LLM usage has seen a meteoric rise in the past year, but there is still a significant gap between agentic innovation and its use in the real world. This is especially true for underserved industries like automotive and healthcare, where outdated systems persist due to barriers to entry, legacy software, and high-stakes consequences of hallucinations and failure.
Here at Toma (YC W24), we are bridging this gap by providing a customer-centric platform to deploy and monitor AI agents, even for non-technical users. We recently raised a $17M Series A from a16z and are building the future of human-AI interactions, starting in the automotive industry.
We’re assembling a team of Avengers: engineers, product managers, former founders, athletes, and leaders from Scale AI, Uber, Braze, Microsoft, Amazon, and more. We consider everyone regardless of their backgrounds or identities. Learn more about us here.
We’re searching for a Senior Software Engineer to own and set technical direction for our core platform. You'll mentor engineers, collaborate closely with product and design, and help create fast, reliable, and magical user experiences. This is a deeply hands-on role—expect to write production code, review contributions, shape our architecture, and scale our platform as we grow.
Owning our entire infrastructure (monorepo with 10+ microservices on AWS and Porter)
Building and maintaining our ML/LLM model deployment pipeline and serving infrastructure
Owning our incident response process, including on-call rotations and alerting systems
Designing and implementing observability solutions across our stack
Partnering with other engineers to improve latency and performance in our realtime systems
Communicating with external vendors regarding cloud offerings and APIs
Upholding compliance endeavors (SOC 2 + GDPR + ISO 27001 in progress)
Managing cloud costs and optimizing resource utilization
3+ years of experience in platform/infrastructure engineering
Strong background in system design, operating systems, and distributed systems
Deep expertise with AWS services (ECS/EKS, IAM, VPC, S3, RDS)
Experience with containerization and orchestration (Docker, Kubernetes)
Track record of building and maintaining production ML/LLM infrastructure
Experience with observability tools (Prometheus, Grafana, ELK stack)
Solid understanding of TypeScript
Don’t think you meet all the qualifications? Apply anyway. We’d love to hear what excites you about us, and we may have a role that's a good fit for you.
MacBook Pro 16" M4 Max (or newest high-end equivalent)
Free daily in-office lunch and dinners
Competitive salary with meaningful equity
Free health, dental, and vision insurance
Weekly team outings and customer visits
Unlimited PTO