Job Title
Principal Silicon Failure Analysis Engineer
Role Summary
Lead the Silicon Failure Analysis (SiFA) lab infrastructure to ensure a safe, highly available, and scalable environment that enables Fault Isolation, Physical Failure Analysis, and Supplier Quality Engineering teams to root-cause advanced semiconductor issues.
This role owns day-to-day lab operations, facilities and utilities coordination, tool enablement, vendor partnerships, safety and compliance, inventory and asset management, and multi-year infrastructure roadmaps.
Experience Level
Senior level β requires extensive leadership and technical experience. The posting specifies 15+ years in semiconductor, R&D, or high-precision lab infrastructure.
Responsibilities
Primary accountabilities for maintaining and scaling SiFA lab operations:
- Lead overall SiFA lab infrastructure and ensure operational readiness, availability, and reliability for FI, PFA, and SQE teams.
- Own day-to-day lab operations and incident resolution; drive MTBF/MTTR and uptime improvements.
- Manage facilities and utilities (power, backup, cooling water, DI/PCW, exhaust, vacuum, CDA, nitrogen, specialty gases) and coordinate upgrades, maintenance, and outages.
- Enable and sustain failure analysis tools from delivery through operation; implement preventive maintenance and reliability metrics.
- Lead vendor relationships and cross-functional coordination with Facilities, EHS, IT, Finance, Procurement, and equipment suppliers.
- Maintain consumables, inventory, and asset lifecycle tracking, including gases, chemicals, PPE, and materials.
- Enforce safety, chemical, ESD, regulatory governance, access control, training, and IP protection requirements.
- Define and execute long-term SiFA lab infrastructure roadmaps and phased expansions to support future nodes and advanced packaging.
Requirements
Core skills and experience required and desirable:
-
Must-have: 15+ years of experience in semiconductor, R&D, or high-precision lab infrastructure with demonstrable ownership of capital equipment enablement and facilities coordination.
-
Must-have: Proven vendor management and cross-functional leadership experience; strong communication and execution skills.
-
Must-have: Experience improving uptime, availability, and maintenance processes for complex capital tools (metrics-driven reliability improvements).
-
Nice-to-have: Prior end-to-end ownership of high-availability failure analysis labs resolving yield, performance, reliability, or quality issues.
-
Nice-to-have: Track record delivering multi-year scaling roadmaps and rigorous safety/compliance governance in lab environments.
Education Requirements
Bachelor's degree or higher in Engineering or a related technical field, or equivalent practical experience.
About the Company
Company: NVIDIA
Headquarters: Santa Clara, California, USA
NVIDIA is a global leader in accelerated computing, renowned for its innovative solutions in AI and digital twins that transform diverse industries. The company specializes in networking technologies, providing end-to-end InfiniBand and Ethernet solutions for servers and storage that optimize performance and scalability. NVIDIA serves sectors such as high-performance computing, enterprise data centers, and cloud computing, constantly reinventing its products and services to stay ahead in the market.

Date Posted: 2026-05-20