About Overland AI
Location: Seattle, WA (Hybrid — 3 days onsite)
Travel: Occasional in-state travel; 1–2 weeks out-of-state per year
Founded in 2022 and headquartered in Seattle, Washington, Overland AI is transforming land operations for modern defense. The company leverages over a decade of advanced research in robotics and machine learning, as well as a field-test forward ethos, to deliver combined capabilities for unit commanders. Our OverDrive autonomy stack enables ground vehicles to navigate and operate off-road in any terrain without GPS or direct operator control. Our intuitive OverWatch C2 interface provides commanders with precise coordination capabilities essential for mission success.
Overland AI has secured funding from prominent defense tech investors including 8VC and Point 72, and built trusted partnerships with DARPA, the U.S. Army, Marine Corps, and Special Operations Command. Backed by eight-figure contracts across the Department of Defense, we are strengthening national security by iterating closely with end users engaged in tactical operations.
Role Summary
Overland AI is looking for an experienced Infrastructure Engineer to help design, build, and operate the systems that power our AI model training, experiment management, and robotic deployments. This role spans on-premise environments, cloud infrastructure, networking, and automation. You’ll work hands-on with servers, storage, firewalls, wireless equipment, and high-performance compute resources—while also developing scalable tooling that improves reliability, observability, and developer velocity.
The ideal candidate has 5+ years of experience in infrastructure engineering, DevOps, SRE, or systems engineering, with deep knowledge of on-prem environments, AWS deployments at scale, and modern infrastructure-as-code and automation practices.
What You'll Do
Build, operate, and evolve on-premise and cloud infrastructure supporting AI/ML development and robotics programs
Deploy and manage AWS environments including IAM, EC2, VPCs, and S3
Install, configure, and troubleshoot physical servers, networking equipment, and storage systems
Implement and maintain infrastructure-as-code (Terraform, Ansible, Puppet, Chef, etc.)
Support on-prem Kubernetes clusters (clusteradm, Kops) and GitOps workflows (ArgoCD, Flux, Spinnaker)
Develop CI/CD pipelines using GitLab or GitHub Actions
Build custom automation and internal infrastructure tooling
Manage observability stacks (Prometheus/Grafana, ELK, Datadog, etc.)
Partner closely with engineering teams to ensure reliability, security, and efficient scaling
Document systems, processes, and runbooks to support local and remote teams
Required Qualifications
5+ years in infrastructure engineering, DevOps, SRE, or systems engineering
Experience with AWS orchestration and deployments at scale
Experience with on-prem hardware environments (VMWare, Proxmox, or equivalent)
Hands-on experience building and troubleshooting physical servers and networks
Strong Linux administration skills
Deep understanding of networking: firewalls, L3 switches, routing, VPNs, WAN/wireless systems
Proficiency with infrastructure-as-code tooling (Terraform, Ansible, Puppet, Chef, etc.)
Experience with Kubernetes and GitOps systems
CI/CD experience with GitLab, GitHub Actions, or similar platforms
Ability to program in Python, Go, Rust, or a similar language (in addition to shell)
Experience with observability and monitoring stacks
Excellent documentation, communication, and collaboration skills
Nice to Have
Familiarity with experiment tracking, ML infrastructure, or data visualization tooling
Experience integrating hardware or embedded systems
Experience deploying or supporting wireless/WAN infrastructure in field, test, or event environments
Familiarity with ML/AI infrastructure, high-performance compute clusters, or robotics-focused environments
Other Requirements
Ability to travel in-state, including occasional long days during deployments or testing
Ability to travel out-of-state for ~1–2 weeks per year
Ability to work onsite in our Seattle office at least 3 days per week
Benefits