Job Description:

As the Head of Platform Resiliency & Operations, you are accountable for operating and engineering the reliability, scalability, and resilience of AssetMark’s platform.

This role owns production operations today—including environments, batch processing, incident response, and day-to-day platform management—which are currently operationally intensive. Your mandate is to transform this reality by driving an engineering-first approach to production management and infrastructure.

You will lead a fundamental shift: from reactive, manual operations to proactive, automated, and engineered reliability—while continuing to deliver a high-quality, always-on platform for our clients.

This role has a twofold mandate:

Deliver on our client commitment by operating a high-availability, high-resiliency platform where reliability is a defining feature of the product Enable high-velocity product development by building systems, tooling, and practices that allow Product & Engineering to move fast without compromising stability

We can only consider candidates for this position who are able to accommodate a hybrid work schedule and are close to our Charlotte, NC office.

What You Will Own

1. Production Operations & Reliability Transformation

Own 24/7 production operations for mission-critical systems, including incident management, batch processing, and environment stability
Lead the transformation of production operations from manual, reactive processes to automated, engineering-driven systems
Establish an engineering-first mandate to eliminate manual toil and operational overhead
Drive systematic improvements in reliability, scalability, and operational efficiency

2. Reliability Engineering & Error Budget Management

Define and operationalize Service Level Indicators (SLIs) and Service Level Objectives (SLOs) across all critical systems
Establish and govern Error Budgets to balance product velocity with platform stability
Drive measurable reduction in operational toil through automation and engineering solutions
Embed reliability targets into planning and decision-making across teams
- Apply Site Reliability Engineering (SRE) principles to quantify and manage reliability

3. Observability & Resilience Engineering

Build full-stack observability (metrics, logs, traces) to improve detection and diagnosis of issues
Evolve monitoring into deep observability with actionable alerting and reduced alert fatigue
Establish resilience testing practices (e.g., game days, fault injection)
Drive automated incident response and self-healing systems
Institutionalize blameless post-mortems focused on systemic improvement
Leverage SRE practices for incident learning and continuous improvement

4. Platform Engineering & Infrastructure

Ensure all infrastructure is managed via Infrastructure as Code (IaC) for consistency, scalability, and recovery
Own reliability and operational integrity of CI/CD pipelines, including automated release gating
Build self-service platforms and tooling that enable engineering teams to deploy and operate services safely
Modernize batch processing and environment management through automation and engineering rigor

5. Shared Reliability Ownership with Engineering

Establish shared accountability for reliability between Platform, SRE, and Software Engineering teams
Partner with Engineering to co-deliver reliability improvements and conduct joint post-incident reviews
Influence engineering practices including production readiness, safe deployments, and observability standards
Ensure reliability is embedded early in the software development lifecycle

6. Ecosystem & Vendor Reliability

Define and enforce reliability standards for third-party vendors and platform dependencies
Establish SLIs/SLOs for external services and manage vendor performance accordingly
Map and govern system dependencies to prevent cascading failures

How Success Is Measured

Sustained improvement in platform reliability as measured by SLO attainment
High availability and resiliency of client-facing systems
Reduction in operational toil and manual intervention across teams
Increased deployment velocity without degradation of reliability
Adoption of Infrastructure as Code and self-service platform capabilities
Reduction in incident frequency and improved detection (MTTD) and recovery (MTTR)
Demonstrated transformation from manual operations to engineering-led reliability

Who You Are

1. Engineering-First Mindset & Technical Depth

Strong background in Software Engineering or Systems Engineering; you lead reliability through code, not process alone
Deep expertise in distributed systems, failure modes, and large-scale platform architecture
Passionate about observability, SLOs, and data-driven reliability management

2. Proven Leadership Across Operations and Engineering

Experience owning production operations for mission-critical systems
Track record of transforming manual, operations-heavy environments into automated, engineering-led platforms
Experience building and scaling SRE and/or Platform Engineering capabilities
Strong incident leadership experience with a focus on blameless culture and systemic improvement

3. Change Leadership & Organizational Influence

Demonstrated ability to drive behavioral change across Engineering and Infrastructure teams
Experience embedding operational rigor into the software development lifecycle (SDLC)
Ability to balance reliability with product velocity through data-driven tradeoffs

4. Executive & Cross-Functional Leadership

Strong partner to Product, Engineering, and Infrastructure leadership
Able to communicate clearly with executives during high-pressure incidents
Deep understanding of reliability as a core business capability in financial services

Leadership Mandate

You are the architect of trust in AssetMark’s platform.

You will operate today’s platform with excellence while transforming it into an engineered, resilient, and scalable system. By driving an engineering-first approach to production operations, you will enable teams to move faster while improving stability—ensuring our advisors and clients can depend on the platform at all times.

This role is accountable for both running the platform and fundamentally reinventing how it is run.

Compensation: The Base Salary range for this position is between $250,000-$300,000.

This information reflects a base salary range that AssetMark reasonably expects to pay for the position based on a number of factors which may include job-related knowledge, skills, education, experience, and actual work location. This position will also be eligible for additional variable incentive compensation and competitive benefits.

Candidates must be legally authorized to work in the US to be considered. We are unable to provide visa sponsorship for this position.

#LI-hybrid

#LI-TN1

Who We Are & What We Offer:

We are AssetMark, a company on the move, shaping the future of financial services. Growth is in our DNA. Every day, we combine technology, insight, and collaboration to create new possibilities for advisors, for our people and for our investors. At AssetMark your ideas matter; they’re heard, valued, and drive meaningful change. Join a team that sets new standards and creates space for you to thrive and do your best work.

Our Mission

Our mission is simple: to help our 10,500+ financial advisors make a meaningful difference in their clients’ lives. We do this by combining powerful technology, holistic support, and expert consulting to help advisors run stronger, more efficient businesses. Backed by a comprehensive suite of investment solutions and a trust company that boasts of $150B+ AUM, our platform empowers advisors to deliver exceptional service and an outstanding client experience.

Our Values

Heart. Client Success. Integrity. Respect. Excellence. Our values are how we show up every day.

We believe in:

Leading with Heart, in truly making a difference in the lives of others: teammates, clients, investors and communities.
Obsessing over Client Success, bringing a relentless focus on what matters to clients that sets us apart and creates loyal, lasting relationships.
Unyielding Integrity, doing what’s right, always. Even when it’s hard.
Collective Respect, in being authentic, inclusive and valuing all voices while winning together.
Operating with Excellence, in learning fast, continuously improving, innovating and collaborating to find new and better solutions.

These values shape our culture, guide our decisions, and define what it means to be part of the AssetMark family.

Our Culture & Benefits

Our culture brings our mission and values to life. Here, we do what’s right, embrace diverse ideas, and innovate together. We also offer a wide range of benefits to support you and your family—because thriving at work starts with thriving in life.

Flex Time or Paid Time Off and Sick Time Off
401K – 6% Employer Match
Medical, Dental, Vision – HDHP or PPO
HSA – Employer contribution (HDHP only)
Volunteer Time Off
Career Development / Recognition
Fitness Reimbursement
Hybrid Work Schedule

As an Equal Opportunity Employer, AssetMark is committed to building a diverse and inclusive workplace where everyone feels valued.

Vice President, Head of Infrastructure Resiliency

Job Description:

Related Jobs

Program Analyst

Supervisory Air Traffic Control Specialist (Assistant General Manager)

Health Insurance Specialist

ADMINISTRATIVE/TECHNICAL SPECIALIST

Intelligence Specialist (Operations)

Director, Office of Procurement