AI Insights
NVIDIA

Senior System Software Engineer, Performance - CUDA Driver

NVIDIA · Santa Clara, California, US
full-timesenior (7-15 yrs)Posted 20d ago
Software EngineeringIC3IC + ManagementHybrid (3d)
StackCCUDAMultithreaded programmingOperating systemsLinuxWindowsmacOSDevice driversGPU architecturex86ARMPOWERMemory managementVirtual memoryPerformance profilingDebuggingSystem softwareKernel mode driversMemory-mapped IOMemory coherence modelsInterconnects

Summary

Senior-level systems software engineering role at NVIDIA focused on performance analysis, optimization, and feature development for the CUDA driver and runtime platform across GPU, OS, and hardware layers.

About the role

We are hiring senior engineers to work on the CUDA driver and runtime, core components of our platform for accelerating general purpose computation on the GPU. Our team analyzes performance of applications, investigates bottlenecks in software or hardware and delivers features and improvements to better realize the potential of NVIDIA hardware for a growing range of computational workloads, ranging from deep learning, scientific computation, and self-driving cars to video games and virtual reality.

CUDA defines a unified programming model across a range of system configurations and hardware capabilities. To accomplish this, the CUDA driver interacts with GPU hardware, kernel mode drivers, and the operating system.

What you'll be doing:

As a member of our team, you will use your design abilities, coding expertise, and creativity to deliver the best compute platform in the world. You will craft elegant solutions to exciting problems and craft the future direction of CUDA as you collaborate with your peers across NVIDIA. You will investigate complex performance problems and deliver robust solutions that accelerate applications.

  • Evangelize, architect, and implement new features

  • Oversee and drive development efforts across multiple teams

  • Analyze full stack performance ranging from application level through libraries, system software, kernel software and hardware

  • Define forward-looking improvements to the CUDA APIs and programming model

  • Create novel system software optimizations

  • Write effective, maintainable, and well-tested code

  • Develop code for multiple operating systems

What we need to see:

  • BS or MS degree in Computer Science, Electrical Engineering (or equivalent experience)

  • 7+ years of related development experience

  • Strong C programming skills

  • Experience working with large codebases

  • Track record of debugging performance problems in complex environments with software and hardware components

  • Experience with operating system interfaces for threads, process control, and virtual memory

  • Experience writing and debugging multithreaded programs

  • Deep understanding of technology and passionate about what you do

  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment. Good written communication.

Ways to stand out from the crowd:

  • Understanding of system level architecture, such as interconnects, memory hierarchy, interrupts, and memory-mapped IO

  • Experience with performance tuning of device drivers or low level system software

  • Experience with performance optimizations across a variety of CPU architectures - like x86, POWER and ARM

  • Knowledge of memory coherence and consistency models

  • Experience with Windows, Linux, or macOS driver development

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and hardworking people in the world working for us. If you're creative and autonomous, and are interested in working on novel problems in a collaborative environment, we want to hear from you!

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 30, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

What you'll do

1Evangelize, architect, and implement new CUDA features
2Oversee and drive development efforts across multiple teams
3Analyze full-stack performance from application level through libraries, system software, kernel software, and hardware
4Define forward-looking improvements to CUDA APIs and programming model
5Create novel system software optimizations
6Write effective, maintainable, and well-tested code
7Develop code for multiple operating systems
8Investigate complex performance problems and deliver robust solutions

Requirements

7+ years of systems software development with strong C programming expertise in large-scale codebases
Proven track record debugging and resolving performance bottlenecks across complex software and hardware environments
Deep experience with OS interfaces including threads, process control, and virtual memory
Experience writing and debugging multithreaded programs in systems-level contexts
Ability to architect and drive development efforts across multiple teams in a collaborative matrix environment

Nice to have

System-level architecture knowledge (interconnects, memory hierarchy, interrupts, MMIO)
Driver or low-level system software performance tuning
Multi-architecture CPU optimization (x86, POWER, ARM)
Memory coherence and consistency model expertise
Windows, Linux, or macOS driver development experience

Role overview

Role family
Software Engineering
Level
IC3 — platform
Experience
7–15 years
Type
Hybrid (IC + Management)
Remote policy
Hybrid (3 days)
Visa sponsorship
Not offered

Tech stack analysis

LANGUAGES
C
FRAMEWORKS
CUDA
INFRASTRUCTURE
LinuxWindowsmacOSGPU hardwareKernel mode drivers
TOOLS
Performance profiling toolsDebugging tools

Green flags

6 items
Salary disclosed with two-level banding: L4 $184K–$287.5K and L5 $224K–$356.5K, plus equity — highly transparent and top-of-market for systems engineers.compensation

Discover all 6 green flags for this role

Sign up free →

Benefits breakdown

See all benefits organized by category — health, financial, time off & more

Sign up free →

Hiring insights

JD quality
9/10
Urgency
medium
Autonomy
high
Team size
medium (5-15)

See JD quality score, hiring urgency & team details

Sign up free →

Red flags

PRO2 items
Benefits package beyond equity is not detailed — no mention of health insurance, PTO, parental leave, or other perks, leaving candidates to assume.compensation

See all 2 red flags — what the JD isn't telling you

Sign up free →

Interview insights

PRO
Rounds
5
Duration
4 wks
Difficulty
very hard
Take-home
No

Get full interview breakdown — rounds, likely topics & prep tips

Sign up free →

Career path

PRO
Next roles
Staff System Software EngineerPrincipal Engineer – CUDA PlatformEngineering Manager – Systems Software

See where this role leads — full career progression

Sign up free →
About the company

NVIDIA is the world's leading designer of GPUs and AI computing platforms. Its chips power everything from gaming and data centers to autonomous vehicles and scientific research. With a market cap exceeding $2 trillion, NVIDIA's CUDA platform and AI accelerators have become the backbone of the global AI revolution.

HQSanta Clara, CA, USA
Interview difficultyvery hard
Build vs Maintainboth
Cross-functionalYes