Job Title
Senior Debug System Engineer, Datacenter
Role Summary
Join NVIDIA's Datacenter product engineering team within Operations as a Senior System Debug Engineer focused on failure analysis and debug during New Product Introduction (NPI). You will work on DGX, MGX, and HGX server products to ensure reliable transfer from development to mass production.
Collaborate with internal teams, vendors, suppliers, and factory personnel to identify root causes, deliver corrective actions, and enable manufacturing and test readiness.
Experience Level
Senior β typically 12+ years of relevant experience.
Responsibilities
Primary duties include failure analysis, debugging, and enabling manufacturing readiness for datacenter GPU systems.
- Perform failure analysis on GPU baseboards and servers at component, system, and rack levels (L6βL11/rack).
- Analyze HW, SW, and FW logs and failures; propose debug and mitigation strategies.
- Design experiments, collect and analyze data to determine root cause.
- Drive DFx/Build-for-Test and manufacturing enabling activities to meet schedule.
- Produce clear failure reports, root cause analyses, and corrective action plans.
- Develop debug guides and transfer knowledge to partner teams.
- Coordinate with vendors, suppliers, internal engineers, and factory personnel globally.
- Travel to factory sites as required to support debug and validation.
Requirements
Key qualifications and skills required for the role.
- 12+ years of professional experience in failure analysis, system debug, or related roles.
- Proven failure analysis/debug experience on motherboards, graphics cards, servers, PCs, or datacenter products.
- Experience enabling DFx and manufacturing/test readiness.
- Strong skills in one or more areas: hardware, software, components, process, test, validation.
- Familiarity with lab characterization equipment such as oscilloscopes and protocol analyzers.
- Strong problem-solving, organization, negotiation, and time-management skills.
- Effective written and verbal communication; ability to work independently and in teams.
- Willingness and ability to travel to factory sites as needed.
Education Requirements
Bachelor's or Master's degree in Electrical Engineering or a related field, or equivalent practical experience.
About the Company
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

Date Posted: 2026-05-20