Deep Learning Compiler Engineer - CUDA
Join the NVIDIA Architecture group to design and implement the DSL and core compiler for a tile-aware GPU programming model targeting emerging GPU architectures. The role focuses on compiler architecture, performance optimization, and integration with AI/ML frameworks for high-performance and LLM workloads.
Entry-level; 2+ years of relevant work experience.
Primary responsibilities include:
Key technical qualifications and skills; degree requirements are listed separately below.
Master's or PhD (or equivalent practical experience) in Computer Engineering, Computer Science & Engineering, Computer Science, Artificial Intelligence, or a related technical discipline.
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.
