We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.
With over 8,000 CI&Ters around the world, we’ve built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.
We are looking for a Senior DevOps Engineer to ensure the reliability, scalability, and performance of cloud infrastructure by managing AWS environments, optimizing Kubernetes clusters, and implementing efficient CI/CD and automation practices, while collaborating closely with development teams and stakeholders.
Responsibilities
• Maintain and optimize AWS infrastructure and EKS clusters to ensure high availability, performance, and resilience.
• Monitor, troubleshoot, and resolve production incidents and platform-related issues in a timely manner.
• Design, implement, and continuously improve CI/CD pipelines to streamline deployment processes.
• Develop and maintain automation scripts using languages such as Python or Bash.
• Partner closely with development teams to support deployments, provide operational guidance, and promote DevOps best practices.
• Communicate infrastructure status, incidents, and improvement initiatives clearly to Directors and other stakeholders.
• Create and maintain comprehensive documentation for systems, processes, and troubleshooting procedures.
Implement and manage Infrastructure as Code (IaC) solutions to ensure consistent and scalable environments.
Requirements
• Proven experience in DevOps or SRE roles with strong exposure to cloud-native environments.
• Hands-on experience with AWS services such as EC2, EKS, IAM, S3, and Load Balancers.
• Strong knowledge of Kubernetes architecture, operations, and cluster management.
• Experience designing and maintaining CI/CD pipelines.
• Experience with Infrastructure as Code tools such as Terraform or CloudFormation.
• Experience managing Helm charts, including public and customized charts.
• Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, CloudWatch, or New Relic.
• Strong troubleshooting and problem-solving skills in production environments.
• Excellent communication skills and ability to collaborate with cross-functional teams.
• Advanced or fluent English.
#LI-GP1