About the role

Skills: Python, PyTorch, NLP, LLMs, Information Retrieval, Entity Resolution, Text Classification

We're building the gateway to the internet for AI agents. Our APIs already power hundreds of customers — and we went from 0 to $7M ARR in our first 12 months. Now we need someone who can push the boundaries of what our ML systems can do.

We're hiring a Founding ML Engineer to own the research and engineering behind our core intelligence layer. Our platform indexes hundreds of millions of professional profiles and company records from across the web. Making that data searchable, matchable, and enriched is an ML problem at its core.

This is not an MLOps role. You will be researching, training, and shipping models - from paper to prototype to production.

Who you are

3+ years building and shipping ML models in production — NLP, information retrieval, or entity resolution
Strong with transformer architectures — you've trained and fine-tuned encoder models, not just called APIs
You know how to build and evaluate retrieval systems, classifiers, and embedding models
Comfortable with contrastive learning, metric learning, and representation learning
Experience using LLMs for structured extraction, classification, or data generation at scale
Strong Python and PyTorch
A true grinder — we work very hard
Founder mentality — someone who wants to be a founder in the future OR was a founder earlier

What you'll be doing

You'll own the ML systems that turn messy, multilingual, web-scale data into structured intelligence. Some example problems:

A customer searches for "RevOps professionals" — you need to return people titled "Head of Revenue Department," "Revenue Operations Manager," and "VP Sales Operations," across English, French, and German
Three different data sources list what looks like three different companies — but it's actually one. You figure out how to resolve that automatically across millions of records
Given raw people data, infer the org chart — who reports to whom, what the team structure looks like, how the engineering org differs from sales
Detect what technologies a company uses from unstructured signals scattered across the web
Classify whether a job change was a promotion, lateral move, demotion, or just a title edit — and do it for millions of transitions
Map raw job titles to canonical titles, seniority levels, and job functions — across dozens of languages and naming conventions

Nice to haves

Experience with entity resolution or record linkage at scale
Built taxonomy or ontology systems over messy real-world data
Background in multilingual NLP or cross-lingual transfer
Scaled LLM inference pipelines in production
Published research or open-source contributions in NLP/IR
Experience with distributed training on GPU clusters

🚀 Y Combinator Company Info

🌐 Company Website

Y Combinator Batch: F24

Team Size: 20 employees

Industry: B2B Software and Services -> Sales

Company Description: Real-time B2B data via simple APIs

💰 Compensation

Salary Range: $150,000 - $275,000

Equity Range: 0.1% - 0.5%

📋 Job Details

Job Type: Full-time

Experience Level: 3+ years

Engineering Type: Machine learning

🛠️ Required Skills

Torch/PyTorch Python Natural Language Processing

🎯 Interview Process

## Interview Process

30 min video call with the CEO
45 min technical deep-dive (ML systems design + paper discussion)
Paid work trial on a real problem (~1-2 weeks)
Offer

🚀 Apply on Y Combinator Work at a Startup

Founding ML Engineer

About the role

Who you are

What you'll be doing

Nice to haves

🚀 Y Combinator Company Info

💰 Compensation

📋 Job Details

🛠️ Required Skills

🎯 Interview Process

Related Jobs

Lead Information Technology Specialist (Sysadmin) NF-04

Engineering Technician NF-04

Corporate AV Technician, Meetings & Events

SMT Process Engineer

Tool Room Sales Associate

Telematics Installer