<gh-intro>
<text>
We are looking for a Staff Machine Learning Engineer, AI Applications to join Bolt’s Customer Support Product team and lead the quality, learning, and knowledge foundations behind our AI automation stack. You will have the opportunity to shape how we evaluate and align LLM-based systems, improve knowledge quality and retrieval, and build the feedback loops that continuously improve customer support automation. If you are passionate about applied ML, retrieval and knowledge systems, and making AI products measurably better in production, this role will perfectly suit you.
</text>
</gh-intro>
<gh-about-us>
<title>About us</title>
<text>
With over 200 million customers in 50+ countries, Bolt is one of the fastest-growing tech companies in Europe and Africa. And it's all thanks to our people.
We believe in creating an inclusive environment where everyone is welcome, regardless of race, colour, religion, gender identity, sexual orientation, national origin, age, or disability.
Our ultimate goal is to make cities for people, not cars, and we need your help to achieve this mission!
</text>
</gh-about-us>
<gh-role-detail>
<title>About the role</title>
<text>
As a Staff Machine Learning Engineer, you will operate at the intersection of advanced AI, knowledge systems, and Bolt’s production support ecosystem, setting the technical direction for how we measure, improve, and scale AI quality across conversational experiences. You will help ensure our systems are accurate, helpful, grounded, and aligned with business and customer needs.
This is a high-impact, high-scope role where you will define the quality and learning strategy behind our AI systems, partner closely with engineering, product, operations, and domain experts, and turn real-world support interactions into durable improvements across models, retrieval, and workflows.
You will work closely with engineers and cross-functional partners building Bolt’s AI automation stack, helping ensure our systems are not only scalable and reliable, but also grounded, aligned, and continuously improving in production.
</text>
</gh-role-detail>
<gh-responsibilities>
<title>Main tasks and responsibilities:</title>
<bulletpoints>
- <point>Lead alignment of Customer Support AI system behaviour to Bolt’s business goals, customer experience standards, and policy requirements, improving answer quality, consistency, grounding, and escalation behaviour..</point>
- <point>Build and scale knowledge mining capabilities that turn conversations, support tickets, workflows, policies, and operational data into reusable knowledge assets and improvement signals.</point>
- <point>Own the evaluation strategy for LLM-powered customer support systems, including offline benchmarks, golden datasets, human review workflows, online experimentation, and production quality metrics.</point>
- <point>Develop feedback loops that connect production failures and human corrections back into prompts, retrieval, benchmarks, and model behaviour, enabling continuous improvement of automation quality.</point>
- <point>Partner closely with engineers working on AI application infrastructure, contributing ML expertise to areas such as tool-use quality, routing behaviour, memory quality, and multi-step task performance.</point>
- <point>Influence roadmap and technical direction across teams by bringing clarity to trade-offs in model behaviour, retrieval quality, experimentation, and business impact.</point>
</bulletpoints>
<text>We are not really looking for Node.js experts, but for cross-language engineers for whom a specific stack is just a tool to solve a problem, not the solution itself.</text>
</gh-responsibilities>
<gh-requirements>
<title>About you:</title>
<bulletpoints>
- <point>You have hands-on experience with retrieval-augmented systems, search, ranking, or knowledge-based AI applications, and understand what drives high-quality grounded responses.</point>
- <point>You have worked on knowledge-heavy or policy-constrained domains where correctness, consistency, and trust matter.</point>
- <point>You have experience building feedback loops from user behaviour, QA signals, annotations, or operational outcomes into system improvements.</point>
- <point>You are able to operate at both strategic and hands-on levels: defining long-term direction while also diving into data, experiments, and failure analysis.</point>
- <point>You have strong communication and stakeholder management skills, and can align engineering, product, and operations around quality strategy and trade-offs.</point>
- <point>Experience with LLM-based pipelines or agentic AI systems is a strong plus, especially in areas such as tool use, routing, memory, multi-step reasoning, or human-AI orchestration.</point>
</bulletpoints>
<text>
Experience is great, but what we really look for is drive, intelligence, and integrity. So even if you don’t tick every box, please consider applying if you feel you’re the kind of person described above!
</text>
</gh-requirements>
<gh-perks>
<title>Why you’ll love it here:</title>
<bulletpoints>
- <point>Play a direct role in shaping the future of mobility.</point>
- <point>Impact millions of customers and partners in 600+ cities across 50+ countries.</point>
- <point>Work in fast-moving autonomous teams with some of the smartest people in the world.</point>
- <point>Accelerate your professional growth with unique career opportunities.</point>
- <point>Get a rewarding salary and stock option package that lets you focus on doing your best work.</point>
- <point>Enjoy the flexibility of working in a hybrid mode with a minimum of 2 days in the office each week to foster strong connections and teamwork.</point>
- <point>Take care of your physical and mental health with our wellness perks.</point>
</bulletpoints>
<text>*Some perks may differ depending on your location and role.</text>
</gh-perks>