Udemy

Staff Site Reliability Engineer

Dublin, Ireland Full Time

Join Udemy. Help define the future of learning.

Udemy is an AI-powered skills acceleration platform built to help people and teams grow. It’s personalized, practical, and focused on real-world impact.

Our mission is simple: to transform lives through learning. Your work helps people around the world build skills they can use, whether they’re picking up something new or leveling up to stay ahead.

Over 80 million learners and 17,000 businesses already learn with Udemy. If you’re excited by change, energized by learning, and ready to have a real impact, you’ll feel right at home. 

Learn more about us on our company page.

Job posting date: Nov 7th 2025
Application deadline: Nov 21st 2025

 

About your skills 

  • Extensive knowledge of cloud technologies, with AWS experience being highly advantageous. 
  • Proven expertise in managing containerized workloads using Kubernetes in production environments. 
  • Proficiency in programming languages such as Python, Golang, or Kotlin. 
  • Strong familiarity with infrastructure-as-code (IaC) tools like Terraform and Helm. 

 

About this role 

As a Staff Site Reliability Engineer at Udemy, you’ll play a critical role in managing and evolving our infrastructure, from our CDN to our databases. You’ll oversee and improve tools like Helm and Terraform, build development environments that empower our engineering teams, and enhance reliability standards across the organization. Collaborating closely with dev teams, you’ll also design internal tools in Python and Golang while responding to incidents and driving best practices in reliability.

 

What you’ll be doing 

  • Leading projects to enhance and optimize our infrastructure and tooling in collaboration with the SRE team and engineering teams across Udemy. 
  • Acting as a mentor to other engineers on the SRE team, fostering growth and technical development. 
  • Championing SRE best practices throughout Udemy's engineering organization. 
  • Designing and implementing powerful, scalable tools to meet internal customer demands. 
  • Supporting and maintaining platforms like Kubernetes clusters and CI/CD pipelines. 
  • Contributing to incident management, identifying root causes, and driving continuous reliability improvements. 
  • Participating in the on-call rota to support mission-critical systems. 

 

What you’ll have 

  • Hands-on experience managing Kubernetes clusters and cloud environments at scale. 
  • Solid expertise in deploying infrastructure using infrastructure-as-code tools. 
  • Strong proficiency in writing tools and applications using languages such as Python, Golang, or Kotlin. 
  • Proven capability of being part of an on-call rotation and managing incidents effectively. 
  • A track record of working with diverse engineering teams, providing guidance on best practices for reliability and scalability. 
  • Excellent communication skills with a collaborative mindset, including the ability to both give and receive feedback constructively.

 

We understand that not everyone will match each of the above qualifications. However, we also realize that everyone has unique experiences that can add value to our company. Even if you think your background might not perfectly align, we'd love to hear from you!

 

#LI-SO1

Why work here?

You’ll grow here.
Learning is part of the job. You’ll get full access to Udemy courses, a monthly UDay to invest in yourself, and a budget to spend on whatever helps you improve. Many people are diving into AI lately, but what you focus on is up to you.

AI is real here.
We use it in the way we learn and the way we work. You’ll have the space and tools to experiment, apply, and get better at using AI in practical ways.

You’ll own your work.
We trust people to lead, make decisions, and follow through. You don’t need to wait for permission or layers of approval to have an impact.

You’ll build with others.
We collaborate openly and shape ideas together. Everyone has a voice, and good thinking is welcomed from any direction.

You’ll see your impact.
What you build helps people grow their skills, change their careers, or find a path forward. You’ve got the experience, why not use it to help others gain theirs?

Bring your curiosity. We’ll bring the platform and the support. Let’s LEARN together. 

Our Benefits Start with U

Our benefits start with you and were built to provide you and your family with the protection and care you need, making it easy to access the right coverage when you need it most. Benefits vary by region, and we encourage applicants to review our Australia Benefits, India Benefits, Ireland Benefits, Mexico BenefitsTurkiye Benefits & US Benefits, pages to get an understanding of some of the benefits we offer. For details on region-specific benefits, please refer to the information provided during the hiring process. 

Benefits outlined are provided as a general overview and may vary depending on the location, role, and employment classification. All benefits are subject to change at the discretion of the organization and in accordance with applicable laws and policies.

At Udemy, we value diversity and inclusion and consider qualified applicants without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, or disability. We understand that not everyone will match each of the qualifications. However, we also realize that everyone has unique experiences that can add value to our company. Even if you think your background might not perfectly align, we'd love to hear from you! 

Information regarding data privacy is available within the Udemy Careers Privacy Notice.