Job Title
Principal AI Performance Engineer
Role Summary
Work with customers and internal engineering teams to optimize inference performance and power efficiency of production AI models on Arm technology. Deliver kernel- to system-level implementations, performance analysis, and recommendations that influence IP and tooling roadmaps.
This is a San Jose-based, customer-facing role with a hybrid work pattern and frequent collaboration across the Bay Area. Candidates must have the right to work in the United States without sponsorship.
Experience Level
Senior / Principal level. Years of experience not specified.
Responsibilities
Key responsibilities include:
- Develop highly optimized kernel- and system-level solutions for AI workloads to meet customer application requirements.
- Create production-quality reference implementations, documentation, and performance-focused technical content.
- Act as the technical interface between customers and internal teams to diagnose and resolve complex performance issues.
- Use customer insights to influence Arm IP and software tooling roadmaps.
Requirements
Must-have technical skills and experience:
- Experience optimizing DNNs using Triton, CUDA, or other kernel-level programming approaches.
- Deep understanding of parallel computing, memory hierarchies, and performance optimization techniques for DNNs.
- Strong programming skills in Python and C++ and familiarity with modern AI frameworks and execution models.
- Experience with profiling and analysis tools for performance debugging.
- Strong communication and interpersonal skills for customer engagement and cross-team collaboration.
Nice-to-have:
- Experience in a customer-facing or field engineering environment.
- Experience within the Arm ecosystem.
- Background in AI performance optimization for edge devices.
Education Requirements
Not specified.
About the Company
Company: Arm
Headquarters: Cambridge, United Kingdom
ARM is a global leader in semiconductor and software design, driving innovation in computing technology. The company specializes in designing processors and systems that provide the essential building blocks for electronic devices. ARM's architecture is widely used in smartphones, servers, and IoT devices, and its collaborative culture fosters bold thinking, diversity, and high-impact benefits for its talented workforce.

Date Posted: 2026-06-22