Deep Learning Compiler Engineer - CUDA

NVIDIA

May 29, 2026

Full-time

On-site

Shanghai, China

Other Semiconductor Jobs, Level - Entry or Early Career

Job Title

Deep Learning Compiler Engineer - CUDA

Role Summary

Join the NVIDIA Architecture group to design and implement the DSL and core compiler for a tile-aware GPU programming model targeting emerging GPU architectures. The role focuses on compiler architecture, performance optimization, and integration with AI/ML frameworks for high-performance and LLM workloads.

Experience Level

Entry-level; 2+ years of relevant work experience.

Responsibilities

Primary responsibilities include:

Design and implement the DSL and core compiler for a tile-aware GPU programming model.
Optimize compiler architecture to improve performance on GPU hardware.
Investigate next-generation GPU architectures and propose compiler-level solutions.
Perform performance analysis on AI/LLM workloads and integrate optimizations with AI/ML frameworks.

Requirements

Key technical qualifications and skills; degree requirements are listed separately below.

Must-have: Strong C/C++ programming and software engineering skills; solid fundamentals in computer architecture; strong problem abstraction and problem-solving ability; 2+ years of relevant experience.
Preferred / Nice-to-have: Compiler experience (MLIR, TVM, Triton, LLVM); GPU architecture knowledge and high-performance kernel programming; familiarity with LLM algorithms or HPC domains; multi-GPU distributed communication experience; ACM background; strong English oral communication.

Education Requirements

Master's or PhD (or equivalent practical experience) in Computer Engineering, Computer Science & Engineering, Computer Science, Artificial Intelligence, or a related technical discipline.

About the Company

Company: NVIDIA

Headquarters: Santa Clara, California, USA

NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

Date Posted: 2026-05-29

Apply now

Deep Learning Compiler Engineer - CUDA

Job Title

Role Summary

Experience Level

Responsibilities

Requirements

Education Requirements

About the Company

More jobs

Graduate Silicon Design Engineer

Advanced Micro Devices

Graduate Silicon Design Engineer

Advanced Micro Devices