Cozmo AI

Chief of Staff - Product Manager

San Francisco, US Full-time

About Cozmo AI

We build AI Employees that speak, think, and act across revenue-critical workflows for regulated enterprises. Real calls, real outcomes, real accountability. Our Agent Protocol Engine logs every step, tool call, and reason code, so customers can trust what the agent did and why. Backed by Y Combinator, Goodwater Capital, and Dubai Future District Fund, we’ve reached $1.5 M ARR in < 6 months, and are scaling from $1.5 M → $10 M ARR in 12 months.

The Role

You will be Nuha’s right hand to turn strategy into shipped product and top-tier agents. You will run the operating system of product at Cozmo AI. You will align engineering, forward-deployed teams, and GTM to move from concept to live agents with measurable business impact.

What you will own

  • Product operating cadence. Build and run weekly product rituals. Define PRDs, clarify requirements, unblock teams, and keep critical paths green.
  • Agent quality and outcomes. Set KPIs for first call success, compliance pass rate, latency, and revenue lift. Drive improvement across STT, LLM reasoning, TTS, tool use, and memory.
  • Roadmap and prioritization. Translate company goals into a 90-day roadmap. Rank work by business impact and technical risk.
  • Integration and last-mile delivery. Partner with forward-deployed engineers to land agents inside real customer systems. Own pre-go-live checklists, data mappings, and acceptance criteria.
  • Program management for speed. Keep multiple workstreams moving across voice pipeline, APE policies, domain adapters, and CRM integrations.
  • Quality systems. Stand up experiment design, eval harnesses, red-team tests, incident review, and release gates.
  • GTM alignment. Turn product capabilities into clear enablement for sales and customer success. Own release notes and internal docs.
  • Hiring bar and culture. Help recruit A-players in product, data, and forward-deployed engineering. Raise the bar.

What great looks like in 6 months

  • P0 features ship on time with visible lifts in agent KPIs.
  • APE policies and evals are standardized per vertical.
  • Two new high-value workflows go live with referenceable customers.
  • Roadmap, metrics, and postmortems are transparent and trusted.
  • Engineering, FDT, and GTM operate on a single source of truth.

Your background

  • Education. Bachelor’s or Master’s in Computer Science, Electrical Engineering, or related field from an Ivy League university.
  • Bonus if you have a MBA. Recent MBA from an Ivy League school, preferred. You use it to reason about impact, unit economics, and pricing.
  • Engineering foundation. 4 to 8 years in software or data. Comfortable reading code, APIs, and infrastructure diagrams.
  • Acute product sense. You can write a crisp PRD in an hour, cut scope without cutting value, and design for real world edge cases.
  • AI or platform experience. Exposure to LLMs, speech pipelines, or agent frameworks is a plus. Curiosity is required.
  • Tools. You can query with SQL, sketch quick analyses in Python, and run a clean board in Jira or Linear.
  • Communication. Clear writing, structured thinking, low ego, high urgency.

Skills we will test for

  • Roadmapping under constraints
  • Agent evaluation design and A/B thinking
  • Technical depth across APIs, data flows, and failure modes
  • Narrative clarity with customers and execs

Compensation

Competitive salary, meaningful equity, health coverage, and relocation support if needed.

🚀 Y Combinator Company Info

Y Combinator Batch: W22
Team Size: 11 employees
Industry: B2B Software and Services
Company Description: We help you build AI Employee who can speak, think and do

💰 Compensation

Salary Range: $120,000 - $239,995
Equity Range: 0.5% - 2.98%

📋 Job Details

Job Type: Full-time
Experience Level: 1+ years

🎯 Interview Process

## Interview process
  1. Intro with the founder. Role, scope, your goals.
  2. Product case. Write a one page PRD for a regulated collections agent.
  3. Technical deep dive. Walk through an integration plan for Salesforce or Oracle and the APE policy needed to ship it.
  4. Working session. Live triage of a latency incident and a compliance failure.
  5. Founder loop. Decision, calibration, and offer.
  6. References. Managers, peers, and cross-functional partners.