Senior Data Scientist
Who we are looking for
We are seeking an 8+ year experienced and highly curious Senior Data Scientist to join our AI Center of Excellence (CoE), focused on developing and scaling enterprise-grade Generative AI (GenAI) solutions. The ideal candidate will have hands-on experience building NLP and LLM-based applications, a deep understanding of modern machine learning techniques, and a passion for applying them to real-world financial services problems.
What you will be responsible for
As Senior Data Scientist – AI CoE (Generative AI), you will:
- Lead the development and experimentation of GenAI capabilities across key initiatives like RAG-as-a-Service, Dynamic SQL as Service, Fine-Tuning Studio, Agent Marketplace and new upcoming projects.
- Design and evaluate solutions for tasks such as summarization, question-answering, classification, text-to-SQL, information extraction, and semantic search.
- Engineer high-quality prompts, retrieval pipelines, and evaluation datasets for enterprise use cases.
- Fine-tune LLMs (e.g., Mistral, LLaMA, Qwen, Gemma, Phi, Falcon) using LoRA/QLoRA, DPO, RLHF, and other continual learning techniques.
- Analyze and mitigate model risks such as hallucination, prompt injection, bias, and toxicity using standard RAI frameworks.
- Collaborate with engineering teams to productionize models and embed them in microservices and pipelines.
- Engage with business stakeholders to understand requirements and demonstrate model effectiveness through rapid prototyping and storytelling.
- Contribute to internal knowledge sharing via workshops, research papers, and training sessions.
What we value
These skills will help you succeed in this role:
- Strong grasp of modern machine learning and NLP methods including transformers, retrieval-augmented generation (RAG), embedding models, and attention mechanisms.
- Hands-on experience with Hugging Face, LangChain/Lang Graph, PyTorch/TensorFlow, Weights & Biases, OpenAI/Anthropic APIs, and vector stores (e.g., Pinecone, FAISS, Chroma, CosmosDB).
- Solid software engineering practices: version control (Git), CI/CD, testing, and working in containerized/cloud environments (Azure, AWS).
- Familiarity with observability and evaluation frameworks (e.g., LangSmith, TruLens, ReLM, ELO rating, hallucination metrics).
- Excellent problem-solving and communication skills, with the ability to explain technical results to non-technical stakeholders.
- Prior experience in financial services is a strong plus.
Education & Preferred Qualifications
- M.S. or B.tech in Computer Science, Data Science, Applied Mathematics, or related field.
- 8+ years of experience building and deploying ML/AI solutions in production environments.
- Publications, patents, or open-source contributions in GenAI or NLP are a plus.
About State Street
Across the globe, institutional investors rely on us to help them manage risk, respond to challenges, and drive performance and profitability. We keep our clients at the heart of everything we do, and smart, engaged employees are essential to our continued success.
We are committed to fostering an environment where every employee feels valued and empowered to reach their full potential. As an essential partner in our shared success, you’ll benefit from inclusive development opportunities, flexible work-life support, paid volunteer days, and vibrant employee networks that keep you connected to what matters most. Join us in shaping the future.
As an Equal Opportunity Employer, we consider all qualified applicants for all positions without regard to race, creed, color, religion, national origin, ancestry, ethnicity, age, disability, genetic information, sex, sexual orientation, gender identity or expression, citizenship, marital status, domestic partnership or civil union status, familial status, military and veteran status, and other characteristics protected by applicable law.
Discover more information on jobs at StateStreet.com/careers
Read our CEO Statement