Lilly

Linux Server Platform Operations Engineer

India, Hyderabad Full time

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

Lilly’s Purpose

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

Come help us unlock the power of the Infrastructure Operations through AI & Automation

The Cloud and Connectivity organization is actively looking for a Linux Server Platform Operations Engineer to join them. Do you like to solve challenges and have an interest in large scale impact? Would you like the ability to impact a global public and private cloud infrastructure operations through AI and Automation? If so, please apply.

Job Summary:

We are seeking a highly skilled and experienced Linux Server Platform Operations Engineer to oversee the operations, management, and support of our enterprise Linux Server environment. The ideal candidate will be responsible for ensuring the stability, reliability, and performance of the Linux server infrastructure. This position requires in-depth technical expertise, leadership skills, and a proactive approach to problem-solving and operational excellence.

This role will have the opportunity to work with the latest Public and Private cloud IaaS services. Our goal is to improve all aspect of our infrastructure availability and reliability through repeatable patterns, new architectural designs, improvements in observability to prevent outages to help increase value across the organization. The role will also provide guidance and direction to our global Lilly operations SMEs and connect with other platform infrastructure operations SME to deliver the daily operations associated with this area.

How You’ll Succeed

  • Be Bold – You will drive Infrastructure Operations to never have to fix the same problem twice through adoption of AI OPS, Event Driven Automation, and robust Observability.
  • Be Fast - You will accelerate initiatives in areas such as: Infrastructure AI OPS automation, cloud IaaS management, and cloud infrastructure as code to enable critical business projects.
  • Be Proactive - You will have groundbreaking chances to transform our operations processes using proactive, predictive, and automated AI & Observability capabilities.
  • Be Your Best - You will bring a high learning agility and Infrastructure operations / engineer skills to help us enable the Lilly Technology strategy, identifying tech opportunities, and accelerate our AI OPS journey.

Key Responsibilities:

Linux Server and Cluster Management:

  • Advanced expertise in Red Hat Enterprise Linux (RHEL), Ubuntu, Amazon Linux, and SUSE Linux Enterprise Server (SLES).
  • Experience in RHEL KVM and RHEL OpenShift, with a good understanding of containerized solutions such as Docker and Kubernetes.
  • Experience managing General Parallel File System (GPFS) clusters and Pacemaker clusters for high availability.
  • Strong Linux network management and troubleshooting skills, including TCP/IP, DNS, DHCP, and firewall configurations and knowledge of Vlan.
  • VMware Vsphere Management, VMware hosted Linux server and Physical server (HP/DELL/IBM) management.
  • Storage management for Linux servers, including LVM, XFS,NFS,NAS etc.

Automation and Scripting:

  • Proficiency in writing Ansible playbooks for automation of system configurations and deployments.
  • Extensive experience with the Ansible Automation Platform, including Ansible Tower and AWX for centralized automation management.
  • Strong skills in Bash scripting for system management and automation tasks, including cron jobs and shell/python scripting.

Disaster Recovery (DR) and Zerto Tool:

  • Experience designing and implementing Disaster Recovery (DR) strategies, including backup and restore procedures.
  • Hands-on experience with Zerto for disaster recovery and business continuity, including Zerto Virtual Manager (ZVM) and Zerto Cloud Appliance (ZCA).
  • Implement and manage General Parallel File System (GPFS) and Pacemaker clusters.

CI/CD and Cloud:

  • Experience building and managing Continuous Integration/Continuous Deployment (CI/CD) pipelines using GitHub Actions, Jenkins, or similar tools.
  • Advanced knowledge of Amazon Web Services (AWS) and Microsoft Azure cloud infrastructure and services, including EC2, S3, VPC, Azure Virtual Machines, and Azure Blob Storage.
  • Manage Red Hat Satellite for patching and system lifecycle management.

Identity and Access Management:

  • Expertise in Centrify and Lightweight Directory Access Protocol (LDAP) integration for authentication and authorization.
  • Strong understanding of Red Hat Satellite for patching, system management, and content lifecycle management.

SOX Security Audit:

  • Experience in SOX (Sarbanes-Oxley) security audit steps for Linux servers, including access control, change management, data backup, and security monitoring.
  • Proficiency in implementing and maintaining SOX compliance for Linux environments, ensuring adherence to regulatory requirements.

24x7 Availability and Agile:

  • Availability for 24x7 support for mission-critical systems, including on-call rotations and incident management.
  • Experience working in Agile environments, including sprint planning, daily stand-ups, and retrospectives.

Documentation and Training:

  • Proficiency in creating technical documentation and Standard Operating Procedures (SOPs).
  • Ability to mentor and train junior team members, including knowledge transfer sessions and technical workshops.

Project & Team Management

  • Provide expertise and leadership to turn ideas and concepts into effective solutions. Mentor team members and share knowledge to elevate team performance
  • Lead infrastructure projects such as server migrations, upgrades, and deployments, ensuring timelines and goals are met.
  • Collaborate with cross-functional teams to plan and implement new technologies or enhancements.
  • Develop and maintain documentation, including procedures, system configurations, and disaster recovery plans

Security & Compliance

  • Ensure adherence to organizational security policies and regulatory compliance requirements.
  • Maintain and troubleshoot OS related configurations such as security updates, antivirus solutions, and vulnerability remediation on Linux Server systems.
  • Assist in periodic SoX and internal audits of server environments and configurations to identify and mitigate risks.

Stakeholder Collaboration

  • Work closely with business units, application teams, and other IT departments to address requirements, dependencies, and operational needs.
  • Communicate effectively with stakeholders, providing updates on operational performance, projects, and incidents.

Incident & Change Management

  • Manage incident resolution and root cause analysis for critical server issues.
  • Coordinate daily operational tasks, incident management, and problem resolution with the team.
  • Oversee change management processes, ensuring minimal impact to production environments.

Required Skills & Qualifications:

Technical Expertise

  • 5-15+ years of experience managing enterprise-scale Linux Server environments.
  • Strong expertise in Red Hat Enterprise Linux (RHEL), Ubuntu, OpenShift, Kubernetes, Amazon Linux, and SUSE Linux Enterprise Server (SLES).
  • In-depth knowledge of satellite patch management, automation (PowerShell scripting), configuration management, Ansible & backup solutions.
  • Proficiency in disaster recovery strategies, and high-availability configurations (e.g., clustering).
  • Familiarity with cloud technologies such as Azure or AWS (especially hybrid cloud environments).
  • Experience with AWS, Azure, CI/CD pipelines, and DevOps methodologies.
  • Knowledge of security best practices, compliance (SOX), and disaster recovery.
  • Strong troubleshooting and problem-solving skills

Soft Skills

  • Strong analytical and troubleshooting skills, with the ability to handle complex technical challenges.
  • Proven leadership and team management experience, with excellent interpersonal and communication skills.
  • Ability to prioritize, multitask, and work effectively under pressure in a fast-paced environment.
  • Strong problem-solving and leadership abilities.
  • Effective communication and collaboration skills.

Education & Certifications

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
  • Certifications: Red Hat Certified Engineer (RHCE), AWS Solutions Architect, or Azure Administrator.
  • Kubernetes/OpenShift certifications preferred.

Desirable Skills:

  • Experience in automating administrative tasks using tools such as Ansible, Terraform, or other DevOps tools.
  • Knowledge of ITSM tools (e.g., ServiceNow) and experience in ITIL-based processes.
  • Experience working in a regulated environment (e.g., healthcare, finance, pharmaceuticals).

Additional Information:

  • Role located in Hyderabad (relocation required).
  • Availability to work flexible work hours is/may be required. This team will support continuous operations across two shifts and therefore, this role will require non-standard work hours, and some work on weekends and holidays.  Appropriate adjustments in benefits will be provided for employees working non-standard hours where applicable.

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.

#WeAreLilly