Real people. Real service.

At SupplyHouse.com, we value every individual team member and cultivate a community where people come first. Led by our core values of Generosity, Respect, Innovation, Teamwork, and GRIT, we’re dedicated to maintaining a supportive work environment that celebrates diversity and empowers everyone to reach their full potential. As an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004, we strive to foster growth while providing the best possible experience for our customers.

Through an Employer of Record (EOR), we are looking for a new Site Reliability Engineer in India to join our growing IT Team. This individual will report into our Director of IT and ensure the scalability, reliability, and performance of our infrastructure and applications with a focus on automation, monitoring, and incident response. If you enjoy bridging software engineering with IT operations, we’d love to hear from you! 

Role Type: Full-Time

Location: Remote from India

Schedule: Monday through Friday with a minimum schedule overlap of 4-5 hours per day with 8:00 a.m. to 5:00 p.m. U.S. Eastern Time to ensure effective collaboration

Base Salary: $29,000 – $36,000 USD per year

Responsibilities:

High-level proficiency of written and verbal communication in English
Design, build, and maintain scalable, reliable systems on GCP (Compute Engine, GKE, Cloud Storage, Cloud SQL)
Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager
Build and maintain observability platforms (monitoring, logging, tracing) using tools such as Stackdriver (Cloud Monitoring), Prometheus, or Grafana
Manage incident response, conduct postmortems, and implement improvements to reduce recurrence
Partner with DevOps and engineering teams to enhance CI/CD pipelines for resilient deployments
Define and monitor SLAs, SLOs, and SLIs to ensure application availability and performance
Implement disaster recovery (DR) and backup strategies across cloud services
Continuously optimize performance, capacity, and cost-efficiency of GCP resources

Requirements:

Bachelors degree in Computer Science, Engineering, or a related field
3+ years of hands-on experience as a Site Reliability Engineer, DevOps Engineer, Systems Engineer, or Cloud Infrastructure Engineer. Proven track record managing production-grade systems on Google Cloud Platform (GCP) or other cloud providers
Strong understanding of Linux/Unix system administration, networking, and troubleshooting. Experience implementing Infrastructure as Code (IaC) using tools like Terraform, Ansible, or Deployment Manager.Familiarity with containerization and orchestration technologies such as Docker and Kubernetes (GKE)
Experience with monitoring and observability tools (Google Cloud Operations Suite, Prometheus, Grafana, Datadog, ELK). Experience defining and monitoring SLAs, SLOs, and SLIs to ensure application uptime and performance. Proven ability to handle incident response, conduct postmortems, and drive root cause analysis
Proficiency in at least one scripting language (Python, Bash, or Go) for automation and tooling.Hands-on experience building or managing CI/CD pipelines (Jenkins, GitLab CI, Cloud Build).Strong background in configuration management and release automation
Related Jobs
NetSuite Solutions Consultant
Beyond cloud consulting inc.
Toronto
Full Time
Federal Account Executive
Box inc
Washington D.C. Metro Area
Full Time
Solution Architect
Capco poland
Canada - Toronto
Full Time
ETL Developer
BBInsurance
Remote - USA
Full time
Engineer, MTB PWF WET EQP
Micron
Taichung - MTB, Taiwan
Full time
Audience Insights and Analytics Lead
Roku
New York, New York
Full Time

Site Reliability Engineer

Related Jobs

NetSuite Solutions Consultant

Federal Account Executive

Solution Architect

ETL Developer

Engineer, MTB PWF WET EQP

Audience Insights and Analytics Lead