Senior AI Training Performance Engineer
Analyze and optimize AI training performance across the hardware and software stack, focusing on GPU-based systems and deep learning frameworks. Join the Deep Learning Architecture team to influence hardware and software roadmaps that improve training throughput and efficiency.
Senior β expects senior-level experience. Typical guidance: PhD (or equivalent experience) with 5+ years, or MS with 8+ years of relevant experience.
Primary duties include profiling, diagnosing, and improving training workloads; implementing production-quality code; and building tools and simulations to inform architecture decisions.
Must-have technical skills and experience.
PhD in Computer Science, Electrical Engineering, or CSEE (or equivalent practical experience) with 5+ years; or MS with 8+ years of relevant work experience. Equivalent practical experience accepted where stated.
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.
