NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
As part of this role, you will be working on a powerful private cloud system that supports various teams across NVIDIA. Imagine scaling cloud services to run on thousands of servers and handling millions of automated jobs every day, improving the efficiency of NVIDIA's software engineers worldwide. We collaborate with teams such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence, Autonomous Vehicles, and Omniverse!
What you’ll be doing:
Craft creative scalable cloud solutions for running millions of jobs, thousands of systems, and petabytes of storage.
Address exciting challenges in infrastructure such as Kubernetes, job scheduling, multi-region services, resource management, and automated recovery.
Develop agentic workflows for infrastructure (e.g., self-healing pipelines, automated resource scaling).
Collaborate with customers to understand their needs and develop innovative solutions that cater to their requirements.
What we need to see:
Proven experience in developing scalable cloud infrastructure solutions from concept to production.
Background in AI/ML, Data Analytics, and their application in infrastructure.
Strong background in object-oriented programming, with a preference for Java or Go.
Ability to collaborate optimally across multiple teams and different time zones.
Bachelor's degree or equivalent experience.
10+ years of experience in infrastructure development.
Ways to stand out from the crowd:
Experience in crafting, implementing, and deploying major infrastructure features.
Expertise in crafting and scaling microservices and deploying them on Kubernetes clusters.
Built robust distributed systems for heterogeneous platforms.
Experience working in infrastructure or on hardware systems
Come and work with us at NVIDIA where we have the most resourceful and dedicated people in the world to advance Artificial Intelligence. If you are passionate about infrastructure, we'd love to hear from you!