Cboe

Engineer - Linux

Singapore Full time

Job Description

Building trusted markets - powered by our people

At Cboe Global Markets, we inspire our people to solve complex challenges together because what we do matters. We provide the financial infrastructure that powers the global economy. As a leading provider of market infrastructure and tradable products, Cboe delivers cutting-edge trading, clearing and investment solutions to market participants around the world.

We are building meaningful ways to support professional and personal development while strengthening the trust we have earned as a global market leader. Our teams are empowered to share ideas, actively pursue them and bring on a challenge. As champions of internal mobility and access to opportunity, we encourage our people to go for it and equip our managers with the training to coach their teams to the next level.

Sound like the place for you? Join us!

Role Overview

The Linux Platform Operations Engineer is a hands-on infrastructure specialist who designs, builds, and maintains the server, storage, operating system, and configuration layers that support all trading environments. This role sits within a small, highly skilled team of engineers focused on reliability, performance, and automation. Members of this team work hands-on with modern hardware, develop in-house automation and monitoring solutions, leverage open-source technologies, and make meaningful contributions to systems that are critical to the business.

Linux Platform Operations Engineer must be able to work independently with limited direct supervision, demonstrating ownership over assigned infrastructure domains including bare-metal systems, OS lifecycle, configuration management, and tooling. This role collaborates closely with other Infrastructure, Software Engineering, SRE, and Security teams globally, and serves as a technical escalation point for Linux-related incidents and capacity questions.

Major Job Duties

1. Systems Build & Configuration Management: Design, build, and maintain server and storage infrastructure supporting all global trading environments. Provide configuration management of new and existing Linux platforms using Salt. Build and manage RPM packages and deploy system updates globally. Monitor engineering activities and change management tickets, evaluating their impact on production operations. Execute change tickets in support of updates to production, disaster recovery, and certification systems. While the primary focus is bare-metal on-premises infrastructure, experience with cloud platforms (AWS, GCP) and containerization (Docker, Kubernetes) is desirable as Cboe evolves its infrastructure strategy.

2. Incident Response & Technical Troubleshooting: Serve as a primary technical responder for P1/P2 Linux infrastructure incidents. Lead or participate in incident triage, root cause analysis, and resolution in coordination with globally distributed engineering and operations teams. Author post-incident reviews and drive remediation tracking to improve long-term platform stability. Provide clear, timely communication to stakeholders during active incidents.

3. Performance Tuning & System Availability: Work closely with Linux Platform Engineering, development and SRE teams on capacity scaling and performance tuning. Deep understanding of the Linux Kernel (scheduling, networking, I/O, monitoring). Operate and maintain low-latency bare-metal infrastructure including hardware health, kernel-bypass networking stacks (e.g., Solarflare/Onload), and NIC configuration. Ensure the operational resiliency of all systems globally.

4. Automation & Process Improvement: Maintain an automate everything attitude - actively identify and reduce administrative overhead through automation. Develop, test, and deploy automation solutions using Python and Shell scripting. Provide thought leadership to identify opportunities for automated system health monitors, alerts, and self-remediating workflows. Leverage AI tools to maximize team efficiency. IaC tooling such as Terraform experience is desired.

5. Reporting & Observability: Create and improve operational reports related to Linux platform health, patch compliance, capacity utilization, and incident metrics (MTTR, patch %, backup success rates). Build and maintain dashboards using Grafana, Prometheus, or equivalent tooling. Analyze system telemetry data sets to troubleshoot or explain perceived performance issues.

6. Infrastructure Project Ownership: Represent the Linux Platform Operations team and take ownership of the team portion of large infrastructure projects. Gather, organize, document, and clearly communicate project requirements to relevant teams. Work across Engineering, Network, and Security teams to execute platform initiatives end-to-end.

7. On-Call & Weekend Testing: Participate in on-call rotations to support production Linux infrastructure 24x7. Lead or participate in weekend maintenance windows, failover tests, and capacity testing exercises as part of Cboe global operations team.

The Ideal Candidate Has:

Education

Computer Science, Computer Engineering, Software Engineering, or a related discipline (preferred, not required).

Area of Expertise and/or Skills

  • 5+ years’ experience managing a large 24x7 enterprise environment (hundreds of servers, multiple sites)
  • Deep understanding of the Linux Kernel (scheduling, networking, I/O, monitoring)
  • Solid background in configuration management (puppet/salt, build RPMs, deploy updates)
  • Ability to automate repetitive tasks using Python and Shell
  • Experience in both bare-metal and Cloud/Kubernetes environments
  • Experience in monitoring and observability platforms such as Prometheus, Grafana IaC and DevOps tooling (Terraform, Git, CI/CD pipelines)
  • Experience in Storage Systems - SAN/NAS, NVMe, RAID; capacity planning - experience
  • Highly methodical and analytical approach to problem solving and evaluating technologies
  • Dedication to quality and attention to detail
  • Fluency in the English language, written and verbal

Additional Requirements

Work Schedule & On-Call:

  • Based on follow-the-sun coverage needs, work schedules may need to adjust throughout the year during daylight savings time; shift starting times may be as early as 7AM SGT and working days will be either Monday – Friday or Wednesday – Sunday.
  • This role requires participation in on-call rotations to support 24x7 production systems.
  • Weekend maintenance windows and capacity testing events are expected periodically.

About Cboe Global Markets

Cboe Global Markets (Cboe: CBOE) is one of the worlds largest exchange holding companies, offering cutting-edge trading and investment solutions to investors around the world. Cboe offers trading across a diverse range of products in multiple asset classes and geographies, including options, futures, U.S. and European equities, and FX markets. Cboe is the home of volatility trading, and the VIX Index is the worlds barometer for equity market volatility.

#LI-CS

Cboe Global Markets is an Equal Opportunity Employer.


 

Any communication from Cboe regarding this position will only come from a Cboe recruiter who has a @cboe.com email or via LinkedIn Recruiter. Cboe does not use any other third party communication tools for recruiting purposes.