NVIDIA

Vice President, Reliability

US, CA, Santa Clara Full time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.

 

Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

 

We are looking for an experienced Reliability Leader to advance the NVIDIA reliability function. This leader will ensure reliability is built into products and processes. They will work consistently over time to minimize failures. They will collaborate with engineering, advanced technology, manufacturing and operations teams. Their duties include defining reliability targets, evolving test methodologies, managing lab investments, and driving continuous improvement across NVIDIA’s rapidly advancing portfolio.

 

This role requires a strategic approach, deep technical expertise in silicon and systems, and the ability to synthesize complex information while leading cross-functional teams to deliver highly reliable products at scale.

 

What you'll be doing:

  • Reliability Strategy, Planning, and Execution:
    • Develop and refine the company’s reliability roadmap in partnership with key collaborators, ensuring alignment with business priorities and customer requirements.
    • Define, track, and report key reliability metrics, establishing mechanisms for ongoing evaluation and continuous improvement.
    • Embed reliability practices throughout the entire product life cycle.
    • Support the development and application of failure prediction modeling methodologies.
  • Prevent and Recover from Reliability Deviations
    • Lead root cause analysis efforts for incidents and recurring reliability issues.
    • Partner with R&D, manufacturing, and supply chain teams to proactively eliminate failure modes through design, process, and production improvements.
    • Establish and maintain reliability standards, procedures, and ongoing improvement processes.
    • Build strong relationships across engineering, operations, and field teams to drive rapid issue resolution and effective communication.
  • Multi-Functional Leadership
    • Lead teams through high-pressure situations by rapidly assimilating technical data, identifying critical issues, and guiding response strategies.
    • Define data requirements including collection, analysis, and reporting of reliability data, ensuring accuracy and relevance for decision-making.
    • Present data-driven insights and recommendations to advise strategy and operational improvements.
    • Evolve sophisticated reliability models, plans, staffing, test equipment, etc. to scale with NVIDIA’s fast-paced and diverse portfolio of products.
  • Resource and Capability Management
    • Attract, develop, and retain top talent.
    • Actively lead critical investments in test equipment, space, and power infrastructure to ensure delivery against milestones.

What we need to see:

  • Bachelor's or Master’s degree in Electrical Engineering, Computer Science, or a related field or equivalent experience.
  • 18+ years of experience in quality or reliability-focused roles, preferably within the semiconductor, systems, or data center industries.
  • 10+ years of leadership experience building high-performing teams and delivering results through influence.
  • Deep expertise in reliability engineering, quality assurance, and related methodologies, applied in a business focused context.
  • Proven capability to establish and drive industry-leading reliability and quality standards.
  • Strong analytical critical thinking skills, evidenced through consistent success in translating data into actionable insights.
  • Outstanding written and verbal communication skills, including proficiency in presenting complex technical concepts clearly to diverse audiences.
  • Consistent track record of managing multiple priorities and driving successful outcomes in a dynamic, forward-thinking environment.
  • Strong ethical standards and a commitment to professionalism, integrity, and confidentiality.

Ways to stand out from the crowd:

  • Expertise in reliability and quality engineering in technology or manufacturing settings.
  • An advanced degree in engineering, reliability, or quality management.
  • Experience operating in a global environment, including leading cross-cultural teams and navigating international quality standards.

 

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 388,000 USD - 557,750 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 20, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.