NVIDIA

Senior Compiler Tools Engineer - CUDA Tile

US, CA, Santa Clara Full time

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”.

We are seeking dedicated Compiler Software Engineers for the CUDA Tile team. NVIDIA GPUs are at the center of the deep learning revolution and continue to enable breakthroughs in generative AI, large language models, recommendation systems, speech recognition, image classification and other areas. Come join us to work with a top-notch team and have broad impact across the entire deep learning community.

What you’ll be doing:

In this role, you will be working on CUDA Tile, a new tile-based programming model for our GPUs. CUDA Tile shipped with CUDA 13.1 and is a major addition to CUDA (https://developer.nvidia.com/cuda/tile).

  • You will design and implement compiler transformations, develop MLIR-based dialects and lowering passes, and optimize the performance of tile-based kernels to ensure they execute efficiently across multiple generations of NVIDIA GPU architectures.

  • The scope of these efforts includes defining public APIs and compiler interfaces, crafting and implementing compiler and optimization techniques, performance optimization and tools, and other general software engineering work.

What we need to see:

  • Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering or a related field (or equivalent experience)

  • 3+ years of relevant work or research experience in compiler optimization, performance analysis and IR design.

  • Ability to work independently, define project goals and scope, and lead your own development effort.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.

Ways to stand out from the crowd:

  • Knowledge of CPU and/or GPU architecture. CUDA or OpenCL programming experience.

  • Experience with the following technologies: MLIR, LLVM, XLA, TVM and deep learning models and algorithms.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 7, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.