McKesson

Staff SRE Engineer

Bangalore, KA, IND - 8th Floor of Onyx Building (I009) Full time

About McKesson Compile

Established in 1833, McKesson is a US Fortune 10 global leader in healthcare supply chain management solutions, retail pharmacy, healthcare technology, community oncology, and specialty care. We partner with life sciences companies, manufacturers, providers, pharmacies, governments, and other healthcare organizations to help provide the right medicines, medical products, and healthcare services to the right patients at the right time, safely and cost effectively.

Based in Bangalore India, McKesson Compile’s data is a comprehensive, full linked system of record for the US Healthcare market, with intelligence on 2M+ healthcare professionals (HCPs) and over 800K facilities. Compile’s data includes high capture medical and pharmacy claims, closed capture Medicare claims (100%), along with best-in-class provider affiliations and customer master.

At McKesson we deliver careers with purpose and potential. Our focus on better health starts with creating an inclusive environment with strong values where you can build a fulfilling career. You can count on us to provide you with resources and opportunities to grow and be your best, while contributing to our pursuit of improving lives.

About the Role

We are seeking a Site Reliability Engineer to join our team and help ensure the reliability, scalability, and performance of our systems. This role combines software engineering and systems administration to build and maintain resilient infrastructure and automation for our applications.

Key Responsibilities

  • Design, implement, and maintain monitoring solutions to ensure system health and performance.
  • Develop and manage CI/CD pipelines using GitHub Actions.
  • Deploy, manage, and troubleshoot containerized applications using Docker and Kubernetes.
  • Support and optimize Java-based applications in production environments.
  • Collaborate with development teams to improve system reliability and reduce operational toil.
  • Implement best practices for incident response, capacity planning, and disaster recovery.
  • Work with Azure cloud services for infrastructure provisioning and management.
  • Analyze and improve system observability using tools like Dynatrace (preferred).
  • Perform Linux and Windows system administration, including patching, configuration, and troubleshooting.
  • Automate operational tasks and workflows using tools such as Ansible, scripting (Python, Bash), or similar technologies to reduce manual effort and improve efficiency.

Required Skills & Qualifications

  • Strong experience with monitoring and observability tools (Dynatrace experience is a plus).
  • Hands-on experience with GitHub Actions for CI/CD automation.
  • Proficiency in Kubernetes and Docker for container orchestration.
  • Familiarity with Azure cloud services.
  • Experience with Ansible.
  • Demonstrated experience in automation of infrastructure and operational processes using scripting or configuration management tools.
  • Experience supporting Java applications in production.
  • Solid understanding of Linux and Windows system administration.
  • Knowledge of SRE principles (SLIs, SLOs, error budgets).