Job Details:
Job Description:
We are looking for a dynamic and passionate hands-on senior contributor to join Intel's AI Group.
Day-to-day work involves contributing to open-source AI frameworks such as PyTorch and inference-serving frameworks like vLLM and SGLang.
The role includes designing, developing, and optimizing features for Intel's AI framework software stack for Intel's AI accelerators and next-generation GPUs.
Roles and Responsibilities include:
- Design and develop software features for AI frameworks—both hardware-agnostic and hardware-aware.
- Enhance and extend deep learning inference and training capabilities in the software stack.
- Analyze and architect state-of-the-art features across different frameworks and drive development across the full software stack.
- Identify optimization opportunities in the software stack to improve the performance of deep learning workloads.
- Participate in discussions with the open-source community, contribute to development, and upstream software enhancements.
Qualifications:
- B.Tech or M.S./M.Tech in CS, ECE, or related fields with 6–12 years of overall experience.
- Proficient in Python-based complex software implementations; intermediate knowledge of advanced C++ (C++14/17) and parallel programming.
- In-depth, hands-on experience with frameworks such as PyTorch, vLLM, and SGLang.
- Experience with advanced inference-serving features such as disaggregated serving, quantization, speculative decoding, and constrained decoding.
- Strong understanding of LLMs
- Practical knowledge of deep learning models for image and video generation is desirable.
- Ability to debug complex issues in multi-layered software systems; understanding of software integration in large open-source frameworks.
- Strong understanding of computer architecture and HW-SW optimization techniques.
- Effective communication skills and experience working in cross-geo teams.
- Ability to perform performance analysis of code on both host and accelerators/GPUs using open-source and proprietary profilers.
- Understanding of the competitive landscape for technologies in this domain.
Preferred
- Experience developing and integrating CUTLASS or Triton-based kernels for deep learning.
- Knowledge of compiler algorithms for heterogeneous systems and fuser optimizations.
Job Type:
Experienced Hire
Shift:
Shift 1 (India)
Primary Location:
India, Bangalore
Additional Locations:
Business group:
As a member of the Chief Technology Office, Artificial Intelligence, and Network and Edge Group (CTO AI NEX), you will be committed to strategically penetrating the AI market by delivering disruptive and transformative solutions. Your focus will be on leveraging technology innovation and incubation to drive commercial success, ensuring that advancements create significant value. The team is dedicated to driving the software-defined transformation of the world's networks profitably, setting new standards for efficiency and connectivity. Through these priorities, you aim to lead the way in technological evolution and redefine the future of global networks.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.