NVIDIA

Senior Systems Engineer - Software

US, CA, Santa Clara Full time

It's an exciting time to join the NVIDIA Cloud Native Engineering (NVCNE) group’s backend software team! As a Cloud Platform Software Engineer, you will work alongside architects, designers, frontend engineers, SREs and others to deliver a software platform that supports the lifecycle of Artificial Intelligence (AI) super compute infrastructure on Kubernetes. Together we will enable AI services across the cloud.
 

The engineer will write software that aligns with the broad architectural vision for the NVIDIA Cloud Platform, working with architects to develop a robust, scalable system. The engineer owns their code - from development to commit to test to production. They will be responsible for supporting SRE teams with development support and collaboration with internal product teams on sophisticated distributed systems problems at scale. This engineer will be encouraged to foster NVIDIA’s approach to Cloud Native development and especially Kubernetes.

What you will be doing:

  • Develop software systems to support large scale deployments of cloud infrastructure

  • Design, develop and distribute APIs to support Infrastructure as Code (IaC) automation and deployment workflows.

  • Responsible for contributing to multiple source code projects to fulfill NVIDIA requirements with software services

  • Work and collaborate with engineering managers, architects, designers, and frontend engineers to deliver high quality software

  • Automate the validation of software solutions with unit and integration tests

  • Innovate with other engineers on proposed designs and product direction

  • Openly share successes and failures in a no blame environment

What we need to see:

  • BS in Computer Science, Information Systems, Computer Engineering (or equivalent experience) and at least 12 years of overall experience

  • 5-7 years of proven experience in large scale software development

  • Experience building and delivering services on Kubernetes

  • Proficiency with cloud-native infrastructure (AWS, GCP, Azure, OCI).

  • Collaborated with teams to write software to support cloud services at scale

  • Ability to troubleshoot issues across multiple layers: infrastructure, Kubernetes, application runtime.

  • Strong proficiency in Golang for building Kubernetes operators, controllers, and custom tooling.

  • Experience designing and managing Kubernetes Custom Resource Definitions (CRDs).

  • Knowledge of managed Kubernetes services and scaling strategies across cloud and on-prem environments.

  • Experience developing auto-scaling infrastructure components and incident response and root cause analysis.

Ways to stand out from the crowd:

  • Experience with Kubernetes Cluster API, Terraform, CSP API and other infrastructure tooling

  • Background with using and contributing to open-source projects

  • Solid experience with Kustomize, or other Kubernetes packaging tools.

  • Capable of refactoring software to run in systems such as Kubernetes

  • Ability to discuss and work with CSI, CNI, and CRI as well as familiarity with the CNCF and the tooling across the ecosystem

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're a creative, curious, and driven technical leader, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 4, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.