Job Title
Principal Systems Software Engineer, LPU
Role Summary
Lead technical direction for LPX system software: hardware abstraction layers, core libraries, drivers, and runtimes implemented in Rust. Collaborate with hardware, compiler, and operations teams to design APIs, drive bring-up, and improve system reliability.
The team builds memory-safe, long-lived platform software that bridges novel silicon and higher-level workloads.
Experience Level
Senior β 12+ years building production system software.
Responsibilities
Primary responsibilities include architecture, implementation, and leadership across low-level system software:
- Define architecture and own API contracts for HALs and core system libraries.
- Design and implement drivers, runtimes, and data-movement pipelines for novel silicon.
- Build runtime interfaces for launching, monitoring, and managing production-scale workloads.
- Lead triage of complex sequencing, initialization, and cross-component runtime failures; produce root-cause analyses that change system design.
- Lead hardware bring-up and NPI in partnership with hardware, compiler, and data center teams.
- Establish agent-assisted engineering practices, reusable abstractions, diagnostics, and documentation to raise team throughput.
- Communicate architecture and design tradeoffs clearly to engineering and executive audiences.
Requirements
Must-have technical skills and experience:
- 12+ years building production system software.
- Deep systems-programming expertise with Rust as the primary language for low-level work; production experience shipping Rust at the hardware or kernel boundary (drivers, firmware, runtimes).
- Track record designing and evolving long-lived libraries and APIs with ABI and compatibility discipline.
- Experience with large, multi-repository codebases and layered dependencies.
- Demonstrated leadership driving triage of difficult reliability issues to clear, written root-cause analysis.
- Low-level platform experience: firmware and boot flows, RTOS, BMCs/MCUs, RISC-V, or closely related system software.
- Linux driver or kernel-adjacent experience (for example, VFIO or similar subsystems).
- Hardware bring-up, fault analysis, diagnostics, and validation in lab environments.
- Established habit of building with AI coding agents and designing work to be agent-amenable while keeping humans in the loop.
Nice-to-have:
- Experience building Rust system software at hyperscaler scale or at Rust-native hardware companies.
- Distributed systems experience: gRPC/RPC frameworks, coordination and telemetry patterns; inference serving stacks (vLLM or similar) a plus.
- Experience shipping and supporting customer-facing SDKs, documentation, and ABI compatibility practices.
- Production readiness: CI/CD and release workflows, monitoring and alerting, Kubernetes, and data center operational workflows.
Education Requirements
MS in Computer Science, Computer Engineering, Electrical Engineering, or a related STEM field, or equivalent practical experience.
About the Company
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

Date Posted: 2026-06-30