NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We build solutions to optimize all layers of the CUDA ecosystem, leading to class-leading speedups in modern high-performance workloads and models. We are looking for an outstanding Senior Software Engineer that can architect and implement these highly scalable solutions to different use-cases. As a member of the team, you will develop new innovative workflows, work on compiler- or runtime-driven solutions that accelerate critical workloads, generate optimal code patterns at scale, and other high-impact AI challenges. You will collaborate closely with internal NVIDIA software and hardware teams to push the latest developments into NVIDIA products.
What you'll be doing:
Design and build high-performance optimization frameworks for the entire CUDA ecosystem.
Co-design novel solutions with software, hardware and algorithm teams; influence and adopt new capabilities as they become available.
Develop reproducible, high-fidelity evaluation frameworks covering performance, quality and developer productivity.
Collaborate across the AI stack — from hardware through compilers/toolchains, kernels/libraries, frameworks, distributed training, and inference/serving.
What we need to see:
Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); MS or PhD preferred.
6+ years of industry or academia experience with software engineering, compilers and developer tools; exposure to building comprehensive optimization frameworks, and hands-on experience with product environments.
Strong knowledge of compilers, code generation, and GPU architecture.
Experience with GPU programming and performance optimization (CUDA or equivalent).
Extensive Python programming skills, along with software engineering fundamentals. Basic programming skills in other languages such as C/C++, Racket and Rust.
Strong mathematical and scientific foundation relevant to optimization heuristics/algorithms, ML and data science.
Track record developing and productizing software, optimization frameworks and/or developer tooling.
Ways To Stand Out From The Crowd:
Familiarity with genetic/evolutionary algorithms, predictive modeling, and complex systems.
Deep expertise in GPU performance optimizations, evidenced by benchmark wins or published results.
Hands-on experience building compilers or compiler components using the LLVM framework, including optimization passes and code generation.
Familiarity with NVIDIA and open source compilers like LLVM, MLIR, PTX and OpenAI Triton.
Experience with Data Science projects, specifically with MLOPS workflows and tools, like W&B, MLflow, etc.
With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and benefits.