NVIDIA logo

Senior Software Engineer, CUTLASS Platform

NVIDIA
June 02, 2026
Full-time
On-site
Santa Clara, California, United States
$152,000 - $287,500 USD yearly
EDA Jobs, Level - Senior

Job Title

Senior Software Engineer, CUTLASS Platform

Role Summary

Join the CUTLASS team to design and implement low-level GPU software components and compiler support that enable high-performance linear algebra and Tensor Core primitives on NVIDIA GPUs. The role focuses on CUDA C++, a CUTLASS Python DSL, and MLIR-based compiler work to produce efficient kernels and demonstrate novel hardware features.

Experience Level

Senior. Typical expectation: 3+ years of relevant industry experience.

Responsibilities

The role combines systems programming, compiler design, and collaboration with GPU architecture and compiler teams. Key responsibilities include:

  • Develop core CUTLASS platform components: Tensor Core MMAs, copies, synchronization primitives, schedulers, and related CUDA C++ code and Python DSL constructs.
  • Design MLIR dialects and compiler passes for the CUTLASS Python DSL and advance the MLIR-based backend.
  • Author example kernels and reference implementations that exercise CUTLASS abstractions and new GPU features to validate performance.
  • Work with GPU architecture, CUDA, and compiler teams to evaluate programming models and assess performance of current and future hardware features.
  • Debug, profile, and test performance-critical code; contribute to performance evaluations and benchmarking.

Requirements

Must-have technical skills and experience:

  • 3+ years of relevant industry experience in systems, compilers, or high-performance software development.
  • Strong C++ programming and software design skills, including debugging, performance evaluation, and testing.
  • Experience with high-performance code generation and knowledge of compiler transformations and optimizations.
  • Deep understanding of computer architecture and parallel computing programming models.

Nice-to-have:

  • Experience writing low-level high-performance kernels (NVVM/PTX or equivalent).
  • Hands-on compiler design experience, particularly with MLIR.
  • Familiarity with deep learning models, algorithms, and frameworks.

Education Requirements

Master's or PhD in Computer Science, Computer Engineering, or a related field β€” or equivalent practical experience.


About the Company

Company: NVIDIA

Headquarters: Santa Clara, California, USA

NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

NVIDIA logo

Date Posted: 2026-06-03