
Deep Learning Performance Architect
Summary
NVIDIA is hiring Software Engineers and Senior Software Engineers to develop GPU-accelerated deep learning inference software, including highly optimized kernels and performance tuning for TensorRT, working cross-functionally with automotive, image, and speech understanding teams.
About the role
We are expanding our research and development for Inference. We seek excellent Software Engineers and Senior Software Engineers to join our team.
We specialize in developing GPU-accelerated Deep learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Collaborate with the deep learning community to implement the latest algorithms for public release in Tensor-RT. Your ability to work in a fast-paced customer-oriented team is required and excellent communication skills are necessary.
What you’ll be doing:
Develop highly optimized deep learning kernels for inference
Do performance optimization, analysis, and tuning
Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop innovative solutions
Occasionally travel to conferences and customers for technical consultation and training
What we need to see:
Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)
SW Agile skills helpful
Excellent C/C++ programming and software design skills
Python experience a plus
Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU
GPU programming experience (CUDA or OpenCL) desired
4 years of relevant work experience
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people on the planet working for us. If you're creative and autonomous, we want to hear from you!
What you'll do
Requirements
Nice to have
Role overview
Tech stack analysis
Salary estimate
NVIDIA is a top-tier semiconductor/AI company in Santa Clara, CA. Senior SWE roles at NVIDIA with GPU/ML specialization typically command $175K–$260K base salary, with significant additional stock compensation (RSUs) and bonuses. The 4+ years experience requirement and MS/PhD preference push this toward the senior band. Comparable NVIDIA roles on Levels.fyi show total compensation well above $300K when including equity.
See the AI-estimated salary range for this role
Sign up free →Green flags
4 itemsDiscover all 4 green flags for this role
Sign up free →Benefits breakdown
See all benefits organized by category — health, financial, time off & more
Sign up free →Hiring insights
See JD quality score, hiring urgency & team details
Sign up free →Red flags
PRO4 itemsSee all 4 red flags — what the JD isn't telling you
Sign up free →Interview insights
PROGet full interview breakdown — rounds, likely topics & prep tips
Sign up free →Career path
PROSee where this role leads — full career progression
Sign up free →NVIDIA is the world's leading designer of GPUs and AI computing platforms. Its chips power everything from gaming and data centers to autonomous vehicles and scientific research. With a market cap exceeding $2 trillion, NVIDIA's CUDA platform and AI accelerators have become the backbone of the global AI revolution.