Deep Learning Performance Architect
Join a deep learning performance team to model, measure, and optimize ML/DL workloads on GPU and accelerator-based systems. The role focuses on performance benchmarking, building and validating performance models, identifying bottlenecks, and proposing architecture and software optimizations for LLM/Generative AI workloads.
Mid-level. The posting indicates that 3+ years of relevant experience is a plus but not strictly required.
Primary responsibilities include analyzing workload performance, creating projections, and driving actionable improvements across hardware and software.
Core qualifications and technical skills required or strongly preferred.
BSc, MS, or PhD in Computer Science, Electrical Engineering, Mathematics, or a related technical discipline.
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.
