Amgen

Principal Data Scientist

India - Hyderabad Full time

Career Category

Information Systems

Job Description

As a Principal Data Scientist, you will build and model digital product and platforms that bring Amgen’s AI/ML and GenAI solutions to life.


This role has to collaborate with Amgen’s Technical Architect, Product Manager, UX designers, and Back-end engineers to design secure, scalable, and user-centric products that accelerate discovery, manufacturing, and commercial analytics, corporate functions products


Key Responsibilities

  • Strong experience with statistics and machine learning, including deep learning, natural language processing (NLP) and, experience in building cloud-scale systems and working with open-source stacks for data
  • Experiment with large language models (LLM), Artificial intelligence (AI) for code or related fields, Generative AI, Foundational Models, Supervised and Unsupervised Learning
  • Collaborate with cross-functional teams to understand the requirement and design solutions that meet business needs. Also with Data Architects, Business SMEs, and Technical Architect to design and develop  to meet fast-paced business needs.
  • Explore new tools and technologies that will help rapid development of solutions
  • Participate in sprint planning meetings and provide estimations on technical implementation 
  • Embed responsible-AI and security-by-design controls


Required Qualifications

  • 6 –12 years of experience in Data Science  (eg: managing structured and unstructured data, applying statistical techniques and reporting results)
  • Doctorate in Computer Science, Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science or a related field 
  • Leverage cloud platforms (AWS preferred) to build scalable and efficient solutions 
  • Strong background in Deep Learning, Machine Learning, NLP, Data Mining.
  • Excellent communication and stakeholder management skills.


Preferred Skills

  • Experience in Generative AI, Foundational Models, LLM's, Feature Engineering, Selection & Extraction, BI & Automation, Predictive Modelling, Data Visualization, CNN, RNN, GNN, Transformers, Exploratory Data Analysis
  • Familiarity with Python packages (Pytorch, TensorFlow, Hugging FaceScikit-learn, Pandas, NumPy, Matplotlib, Cloud Vision API, RAG, TensorBoard, OpenCV, NLTK)), programming languages ( C, C++, Java, CUDA,SQL, NoSQL, PHP, HTML, JS, CSS etc.)
  • Prior exposure to pharma / life sciences AI environments is preferred.
  • Strong understanding of Responsible AI and model validation principles.

.