NVIDIA logo

Principal Systems Software Engineer, LPU

NVIDIA
July 01, 2026
Full-time
On-site
Santa Clara, California, United States
$272,000 - $431,250 USD yearly
Other Semiconductor Jobs, Level - Senior

Job Title

Principal Systems Software Engineer, LPU

Role Summary

Lead technical direction for LPX system software: hardware abstraction layers, core libraries, drivers, and runtimes implemented in Rust. Collaborate with hardware, compiler, and operations teams to design APIs, drive bring-up, and improve system reliability.

The team builds memory-safe, long-lived platform software that bridges novel silicon and higher-level workloads.

Experience Level

Senior β€” 12+ years building production system software.

Responsibilities

Primary responsibilities include architecture, implementation, and leadership across low-level system software:

  • Define architecture and own API contracts for HALs and core system libraries.
  • Design and implement drivers, runtimes, and data-movement pipelines for novel silicon.
  • Build runtime interfaces for launching, monitoring, and managing production-scale workloads.
  • Lead triage of complex sequencing, initialization, and cross-component runtime failures; produce root-cause analyses that change system design.
  • Lead hardware bring-up and NPI in partnership with hardware, compiler, and data center teams.
  • Establish agent-assisted engineering practices, reusable abstractions, diagnostics, and documentation to raise team throughput.
  • Communicate architecture and design tradeoffs clearly to engineering and executive audiences.

Requirements

Must-have technical skills and experience:

  • 12+ years building production system software.
  • Deep systems-programming expertise with Rust as the primary language for low-level work; production experience shipping Rust at the hardware or kernel boundary (drivers, firmware, runtimes).
  • Track record designing and evolving long-lived libraries and APIs with ABI and compatibility discipline.
  • Experience with large, multi-repository codebases and layered dependencies.
  • Demonstrated leadership driving triage of difficult reliability issues to clear, written root-cause analysis.
  • Low-level platform experience: firmware and boot flows, RTOS, BMCs/MCUs, RISC-V, or closely related system software.
  • Linux driver or kernel-adjacent experience (for example, VFIO or similar subsystems).
  • Hardware bring-up, fault analysis, diagnostics, and validation in lab environments.
  • Established habit of building with AI coding agents and designing work to be agent-amenable while keeping humans in the loop.

Nice-to-have:

  • Experience building Rust system software at hyperscaler scale or at Rust-native hardware companies.
  • Distributed systems experience: gRPC/RPC frameworks, coordination and telemetry patterns; inference serving stacks (vLLM or similar) a plus.
  • Experience shipping and supporting customer-facing SDKs, documentation, and ABI compatibility practices.
  • Production readiness: CI/CD and release workflows, monitoring and alerting, Kubernetes, and data center operational workflows.

Education Requirements

MS in Computer Science, Computer Engineering, Electrical Engineering, or a related STEM field, or equivalent practical experience.


About the Company

Company: NVIDIA

Headquarters: Santa Clara, California, USA

NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

NVIDIA logo

Date Posted: 2026-06-30