Job Details:
Job Description:
The Applied Machine Learning group is responsible for innovation and development of end-to-end AI solutions, technology proof of concepts, and IP development of current and future ML workloads for Intel architecture and silicon serving consumer and corporate business requirements.
In this position, you will be responsible for research, modeling, prototyping, productizing of ML techniques, generating data insights and optimizations for Intel platforms.
Responsibilities include but are not limited to:
- Builds machine learning based products/solutions, which provide descriptive, diagnostic, predictive, or prescriptive models based on data.
- Uses or develops machine learning algorithms, such as supervised and unsupervised learning, deep learning, reinforcement learning, generative AI, large language model and others, to solve applied problems in various disciplines such as Data Analytics, Computer Vision, Natural Language Processing, Recommendation System, Graph Neural Network, Robotics, etc.
- Interacts with users to define requirements for breakthrough product/solutions.
- In either research environments or specific product environments, utilizes current programming methodologies to translate machine learning models and data processing methods into software.
- Completes programming, testing, debugging, documentation and/or deployment of the solution/products.
- Engineers big data computing frameworks, data modeling and other relevant software tools.
- You will play a key technical role for end-2-end machine learning and deep learning platform development based on various frameworks and hardware (such as CPU, GPU, accelerators).
- You will also be responsible for developing AI ML solutions and methodologies to bring the best performance, accuracy, efficiency, and ease-of-use to customers by working with internal and external partners.
The job scope may include but not limited to:
- End-2-end ML and DL platform component innovation and feature development in data ingestion, feature engineering, distributed training via data and model parallelization, hyper-parameter optimization, neural architecture search, model compression, quantization, distillation, and model serving.
- Algorithm and model development of advanced technologies in computer vision, natural language processing, large language model, recommendation, graph analytics, reinforcement learning, and other domains.
- Machine learning framework and workload performance profiling, optimization, insights generation for benchmark such as MLPerf as well as real-world customer use cases.
- Software and tools development in python, C++, and other languages as required.
Behavioral traits that we are looking for:
- Excellent communication skills.
- Willing to clearly communicate technical details and concept.
Qualifications:
Minimum qualifications, you must possess the below minimum qualifications to be initially considered for this position:
- Master´s degree with 4+ years or Ph.D. with 2+ years of experience in: Computer Engineering, Computer Science, Data Science, Software Engineering, Electronic Engineering, Physics, Mathematics, Aerospace engineering, applied mathematics, mechanical engineering, or related STEM disciplines
4+years of the following technical skills:
- Experience in deep learning frameworks such as PyTorch, TensorFlow, working on CPU / GPU / AI accelerators for ML/DL.
- Experience in GenAI frameworks such as transformers, PEFT, diffusers, TGI, TEI, vLLM or SGLang for both training finetuning and inference serving.
- Working with Performance optimization / tuning.
- Experience in computer vision, recommendation, natural language processing, or reinforcement learning.
Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates:
- Proven track record of a leadership role in machine learning, deep learning research and applications demonstrated by patents, publications, product delivery, or other means.
- Experience on performance optimization for PyTorch framework, MLPerf benchmark and other SOTA workload.
- Distributed training, DeepSpeed, Torch DDP and FSDP, Ray SGD.
- Inference optimization such as quantization, sparsity, distillation.GenAI end-2-end workflow development and deployment for model serving, RAG and agent system.
Job Type:
Experienced Hire
Shift:
Shift 1 (United States of America)
Primary Location:
US, California, Santa Clara
Additional Locations:
Business group:
The Software and AI (SAI) Team drives customer value by enabling differentiated experiences through leadership AI technologies and foundational software stacks, products, and services. The group is responsible for developing the holistic strategy for client and data center software in collaboration with OSVs, ISVs, developers, partners and OEMs. The group delivers specialized NPU IP to enable the AI PC and GPU IP to support all of Intel's market segments. The group also has HW and SW engineering experts responsible for delivering IP, SOCs, runtimes, and platforms to support the CPU and GPU/accelerator roadmap, inclusive of integrated and discrete graphics.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Benefits:
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here:
https://intel.wd1.myworkdayjobs.com/External/page/1025c144664a100150b4b1665c750003
Annual Salary Range for jobs which could be performed in the US:
$188,700.00-266,400.00 USD
The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.