NVIDIA is searching for an outstanding researcher working on Large Language Model (LLM) research team. We are passionate about research that pushes boundaries but also has impact in the real world. We are particularly excited about methods for post-training and alignment, principled approaches to synthetic data generation and filtering, advanced reasoning and inference algorithms for LLMs, novel learning paradigms and LLM architectures, and scientific understanding about the fundamental limits and capabilities of LLMs. You will work within an amazing and collaborative research team that consistently publishes at the top venues in machine learning and natural language processing fields. Our existing expertise includes deep learning, NLP, computer vision. Your contributions have the chance to create real impact on our products.

What you'll be doing:

Explore alternative avenues to unlock new capabilities in language models, including advanced knowledge acquisition techniques and innovative learning and decoding algorithms.
Innovate new learning paradigms that incorporate agency into the training of language models, such as enabling self-reflection and targeted knowledge enhancement.
Enable learning from multi-modalities beyond written text, such as acquiring physical commonsense knowledge through interactions with real-world environments.
Publish original research.
Collaborate with other team members and teams.
Mentor interns.
Speak at conferences and events.
Work with product groups to transfer technology.
Collaborate with external researchers.

What we need to see:

PhD in Computer Science or Computer Engineering (or equivalent experience).
At least 6 years of research experience (demonstrated by publication records spanning across 5+ years) in artificial intelligence, machine learning, natural language processing, computer vision or related subjects
A history of research success exemplified by a strong publication record and awards.
Excellent knowledge of theory and practice of deep learning and natural language processing.
Background in LLM training, alignment, and evaluation is expected.
Excellent programming skills in Python and PyTorch.
Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.
Excellent communications skills.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and productive people in the world working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 26, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents

Related Jobs

Teaching Assistant, Fashion Enterprise

Learning Experience Leader - Tier 2/Fleet Response (Up to 80% travel required)

Research Specialist A/B

Student Recruitment & Retention Specialist

Corporate Development Manager

CS Instructor - Mandarin School