Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Director, Site Reliability Engineering
Director, Site Reliability Engineering
Our Purpose:
Mastercard powers economies and empowers people across more than 200 countries and territories worldwide.
We are committed to building an inclusive, digital economy that benefits everyone, everywhere—by making transactions safe, simple, smart, and accessible. Through secure data, trusted networks, strong partnerships, and relentless innovation, we help individuals, financial institutions, governments, and businesses unlock their greatest potential.
About the Role:
Mastercard’s Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers. We achieve this by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards, ensuring compliance with rigorous security requirements.
Within Mastercard, SRE focuses on the reliability and performance of core infrastructure, networks, and foundational services that power our applications. Our mission is to ensure these components operate with excellence, enabling applications to deliver an outstanding customer experience.
In this role, you will join our Payments Network SRE team, where you will manage a team of highly skilled SRE infrastructure engineers with diverse expertise. Your team’s mission will be to continuously support, assess, and enhance the service quality of our business application infrastructure and environments.
Key Responsibilities:
Lead the vision, strategy, and execution of the Infrastructure SRE organization supporting mission critical Payment Networks applications, ensuring alignment with business and platform roadmaps.
• Provide strong technical leadership by driving high level architectural discussions, influencing cross functional engineering teams, and shaping scalable, secure, and highly available infrastructure solutions.
• Mentor, develop, and support engineers across skill levels, overseeing team meetings, one on ones, performance management, and long term career development plans.
• Establish, track, and report on key team OKRs and KPIs that support broader business objectives, infrastructure health, and operational maturity.
• Foster a culture of innovation, collaboration, and continuous improvement across engineering and operational teams.
• Drive governance, enterprise standards, compliance requirements, and operational excellence to increase platform scalability, uptime, availability, and resiliency.
• Advance observability and telemetry capabilities to enable proactive monitoring, intelligent alerting, automated remediation, and improved root cause analysis (RCA).
• Champion reliability engineering best practices—including chaos engineering, capacity planning, incident management, and service readiness processes—to reduce operational risk and service disruption.
• Partner closely with Product, Architecture, Security, and Development teams to ensure infrastructure design, operational frameworks, and run time practices support both current and future business needs.
• Own and optimize incident response frameworks, post incident reviews, and reliability KPIs to continuously reduce incident frequency, impact, and mean time to recovery (MTTR).
• Oversee budget planning, resource allocation, and vendor/technology evaluations to ensure cost effective and scalable infrastructure investment decisions.
All about you:
5–10 years’ experience as a technology leader in Site Reliability Engineering, Infrastructure Operations, or delivering large‑scale infrastructure solutions.
Strong people and performance management skills, with a demonstrated ability to coach, mentor, mature, and motivate high‑performing technical teams.
Proven experience driving a culture of accountability, continuous improvement, and operational excellence.
Deep knowledge of core infrastructure technologies, including database, compute, storage, networking, cloud platforms, virtualization, and containerization.
Strong understanding of infrastructure architecture principles, including lifecycle management, governance and operational readiness.
Ability to lead teams through complex technical problems, with a proven track record in root cause analysis (RCA) across multi‑disciplinary engineering groups.
Strong working knowledge of ITIL best practices, including Change, Incident, Problem, and Service Management.
Demonstrated experience improving operational processes, reducing incident noise, and enhancing system reliability and availability.
Skilled in driving data‑driven operational decisions, using SLIs/SLOs, KPIs, and service health metrics.
Knowledge of SRE principles, including automation, observability, monitoring, capacity management, and resilience engineering.
Experience implementing infrastructure-as-code, automation frameworks, and continuous improvement initiatives that reduce toil and enhance stability.
Excellent communication skills with the ability to translate complex technical issues into clear, actionable information for senior leaders and non‑technical stakeholders.
Strong collaboration mindset with a history of partnering effectively across Product, Engineering, Architecture, and Security teams.
Demonstrated success leading teams through large‑scale change initiatives, platform migrations, cloud adoption, or major service transformations.
The Payments Network SRE team is responsible for the runtime availability of some of Mastercard’s most critical core payment systems, which support national infrastructure and operate 24/7 year‑round. As a result, this role will include periodic on‑call responsibilities when required.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
• Abide by Mastercard’s security policies and practices;
• Ensure the confidentiality and integrity of the information being accessed;
• Report any suspected information security violation or breach, and
• Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
Abide by Mastercard’s security policies and practices;
Ensure the confidentiality and integrity of the information being accessed;
Report any suspected information security violation or breach, and
Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.