Job Title
Hardware Systems Engineer, NPI AI
Role Summary
Support new product introduction (NPI) for next-generation AI and high-performance computing infrastructure used in large-scale data centers. Partner with hardware design, firmware, software, networking, and capacity engineering teams to validate, debug, and scale AI server systems from early bring-up through production readiness.
Experience Level
Mid-level (recommended 6+ years of relevant experience in hardware systems or system-level bring-up and validation).
Responsibilities
Lead validation and bring-up activities across AI/HPC server platforms and components, drive defect resolution, and define deployment readiness criteria.
- Define and lead end-to-end system validation strategies for AI accelerators, GPU clusters, and related subsystems.
- Perform hands-on bring-up, characterization, and validation of AI server systems and components (PCIe, NVLink, DRAM, high-speed networking fabrics).
- Develop and maintain test specifications, validation procedures, and debug guides for NPI programs.
- Investigate and root-cause complex failures spanning silicon, firmware, software, and hardware layers.
- Triage and track hardware and firmware defects to resolution while meeting NPI milestones.
- Identify gaps in test coverage and improve test methodologies, tooling, and automation frameworks.
- Partner with platform and capacity teams to define acceptance criteria and deployment readiness standards.
- Guide data collection and analysis to surface systemic hardware quality trends and support go/no-go decisions.
- Communicate validation status, risks, and technical findings to internal teams and external vendors.
Requirements
Must-have technical skills and domain experience required to perform the role.
Must-have:
- 6+ years experience in hardware systems engineering, silicon validation, firmware validation, or system-level bring-up for AI servers, GPUs, TPUs, or AI accelerators.
- Experience in one or more: ASIC bring-up/characterization, board-level debug, firmware validation, or large-scale system validation in data center environments.
- Experience developing test specifications, validation procedures, and debug methodologies for complex hardware systems.
- Proven ability to lead root-cause analysis and troubleshoot system-level failures across hardware, firmware, and software stacks.
- Experience with high-speed interconnects or memory subsystems such as PCIe, NVLink, DDR5, or HBM in AI/HPC validation contexts.
Nice-to-have:
- Experience with SoC debugging tools (JTAG, GDB, Trace32) and common bus protocols (I2C, SPI, USB, PCIe).
- Experience defining hardware-software interfaces for telemetry, diagnostics, and out-of-band management.
- Experience integrating lab instrumentation and automation frameworks for large-scale NPI validation.
- Proficiency in Linux and server system management tools used in data center operations.
Education Requirements
Bachelor's degree in Computer Science, Computer Engineering, or a relevant technical field, or equivalent practical experience.
About the Company
Company: Meta Platforms
Headquarters: Menlo Park, California, United States
American technology company that develops social networking products (Facebook, Instagram, WhatsApp) and invests in virtual/augmented reality hardware and software through Reality Labs, focusing on connectivity, advertising, and immersive computing experiences.

Date Posted: 2026-07-02