Meta Platforms logo

Hardware Systems Engineer, NPI AI

Meta Platforms
July 02, 2026
Full-time
On-site
Menlo Park, California, United States
$144,000 - $204,000 USD yearly
Test Engineering Jobs, Level - Mid-Career

Job Title

Hardware Systems Engineer, NPI AI

Role Summary

Support new product introduction (NPI) for next-generation AI and high-performance computing infrastructure used in large-scale data centers. Partner with hardware design, firmware, software, networking, and capacity engineering teams to validate, debug, and scale AI server systems from early bring-up through production readiness.

Experience Level

Mid-level (recommended 6+ years of relevant experience in hardware systems or system-level bring-up and validation).

Responsibilities

Lead validation and bring-up activities across AI/HPC server platforms and components, drive defect resolution, and define deployment readiness criteria.

  • Define and lead end-to-end system validation strategies for AI accelerators, GPU clusters, and related subsystems.
  • Perform hands-on bring-up, characterization, and validation of AI server systems and components (PCIe, NVLink, DRAM, high-speed networking fabrics).
  • Develop and maintain test specifications, validation procedures, and debug guides for NPI programs.
  • Investigate and root-cause complex failures spanning silicon, firmware, software, and hardware layers.
  • Triage and track hardware and firmware defects to resolution while meeting NPI milestones.
  • Identify gaps in test coverage and improve test methodologies, tooling, and automation frameworks.
  • Partner with platform and capacity teams to define acceptance criteria and deployment readiness standards.
  • Guide data collection and analysis to surface systemic hardware quality trends and support go/no-go decisions.
  • Communicate validation status, risks, and technical findings to internal teams and external vendors.

Requirements

Must-have technical skills and domain experience required to perform the role.

Must-have:

  • 6+ years experience in hardware systems engineering, silicon validation, firmware validation, or system-level bring-up for AI servers, GPUs, TPUs, or AI accelerators.
  • Experience in one or more: ASIC bring-up/characterization, board-level debug, firmware validation, or large-scale system validation in data center environments.
  • Experience developing test specifications, validation procedures, and debug methodologies for complex hardware systems.
  • Proven ability to lead root-cause analysis and troubleshoot system-level failures across hardware, firmware, and software stacks.
  • Experience with high-speed interconnects or memory subsystems such as PCIe, NVLink, DDR5, or HBM in AI/HPC validation contexts.

Nice-to-have:

  • Experience with SoC debugging tools (JTAG, GDB, Trace32) and common bus protocols (I2C, SPI, USB, PCIe).
  • Experience defining hardware-software interfaces for telemetry, diagnostics, and out-of-band management.
  • Experience integrating lab instrumentation and automation frameworks for large-scale NPI validation.
  • Proficiency in Linux and server system management tools used in data center operations.

Education Requirements

Bachelor's degree in Computer Science, Computer Engineering, or a relevant technical field, or equivalent practical experience.


About the Company

Company: Meta Platforms

Headquarters: Menlo Park, California, United States

American technology company that develops social networking products (Facebook, Instagram, WhatsApp) and invests in virtual/augmented reality hardware and software through Reality Labs, focusing on connectivity, advertising, and immersive computing experiences.

Meta Platforms logo

Date Posted: 2026-07-02