Senior Deep Learning Hardware Modeling Architect - LPU
Join NVIDIA's team optimizing AI inference hardware-software co-design for large language model (LLM) inference. The role focuses on modeling critical hardware components, producing executable C++ models, and driving component and system architectural specifications to improve inference performance and efficiency.
Senior level. Requires 5+ years of relevant experience.
Key responsibilities include specification, modeling, and performance engineering of hardware components and systems.
Must-have technical skills and experience.
Nice-to-have:
BS or higher in Computer Science, Electrical Engineering, Mathematics, or a related technical field β or equivalent practical experience.
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.
