Senior Quantization Engineer - Edge AI
Senior engineer responsible for research and engineering to optimize neural networks for on-device deployment on NXP Ara2 NPUs. Primary focus is model quantization, with work also covering speculative decoding, pruning and other efficiency techniques for CNNs, LLMs and VLMs.
The role spans prototype research, hardware-aware adaptation, and production implementation, working across AI research, hardware engineering and software teams.
Senior-level. Specific years of experience not stated.
Key responsibilities center on translating state-of-the-art model optimization research into efficient production implementations and deployment recipes for embedded NPUs.
Must-have technical skills and practical experience for successful performance in the role.
Listing cites an MSc or Ph.D. (listed as a plus) in Computer Science, Electrical Engineering, or Mathematics with a focus on Machine Learning or Deep Learning. No other degrees, certifications, or explicit "equivalent experience" language were provided.
Company: NXP Semiconductors
Headquarters: Nijmegen, Netherlands
NXP Semiconductors N.V. is a global semiconductor company that provides High Performance Mixed Signal and Standard Product solutions. With over 45,000 employees and operations in more than 35 countries, NXP is a leader in secure connectivity solutions for embedded applications, catering to automotive, industrial IoT, mobile, and communication infrastructure markets. The company is committed to innovation and sustainability, advancing a smarter, safer, and more sustainable world through technology.
