AssetMark

Vice President, Head of Infrastructure Resiliency

Charlotte, NC Full time

Job Description:

As the Head of Platform Resiliency & Operations, you are accountable for operating and engineering the reliability, scalability, and resilience of AssetMark’s platform.

This role owns production operations today—including environments, batch processing, incident response, and day-to-day platform management—which are currently operationally intensive. Your mandate is to transform this reality by driving an engineering-first approach to production management and infrastructure.

You will lead a fundamental shift: from reactive, manual operations to proactive, automated, and engineered reliability—while continuing to deliver a high-quality, always-on platform for our clients.

This role has a twofold mandate:

Deliver on our client commitment by operating a high-availability, high-resiliency platform where reliability is a defining feature of the product Enable high-velocity product development by building systems, tooling, and practices that allow Product & Engineering to move fast without compromising stability

We can only consider candidates for this position who are able to accommodate a hybrid work schedule and are close to our Charlotte, NC office.

What You Will Own

1. Production Operations & Reliability Transformation

  • Own 24/7 production operations for mission-critical systems, including incident management, batch processing, and environment stability
  • Lead the transformation of production operations from manual, reactive processes to automated, engineering-driven systems
  • Establish an engineering-first mandate to eliminate manual toil and operational overhead
  • Drive systematic improvements in reliability, scalability, and operational efficiency

2. Reliability Engineering & Error Budget Management

  • Define and operationalize Service Level Indicators (SLIs) and Service Level Objectives (SLOs) across all critical systems
  • Establish and govern Error Budgets to balance product velocity with platform stability
  • Drive measurable reduction in operational toil through automation and engineering solutions
  • Embed reliability targets into planning and decision-making across teams
    • Apply Site Reliability Engineering (SRE) principles to quantify and manage reliability

3. Observability & Resilience Engineering

  • Build full-stack observability (metrics, logs, traces) to improve detection and diagnosis of issues
  • Evolve monitoring into deep observability with actionable alerting and reduced alert fatigue
  • Establish resilience testing practices (e.g., game days, fault injection)
  • Drive automated incident response and self-healing systems
  • Institutionalize blameless post-mortems focused on systemic improvement
  • Leverage SRE practices for incident learning and continuous improvement

4. Platform Engineering & Infrastructure

  • Ensure all infrastructure is managed via Infrastructure as Code (IaC) for consistency, scalability, and recovery
  • Own reliability and operational integrity of CI/CD pipelines, including automated release gating
  • Build self-service platforms and tooling that enable engineering teams to deploy and operate services safely
  • Modernize batch processing and environment management through automation and engineering rigor

5. Shared Reliability Ownership with Engineering

  • Establish shared accountability for reliability between Platform, SRE, and Software Engineering teams
  • Partner with Engineering to co-deliver reliability improvements and conduct joint post-incident reviews
  • Influence engineering practices including production readiness, safe deployments, and observability standards
  • Ensure reliability is embedded early in the software development lifecycle

6. Ecosystem & Vendor Reliability

  • Define and enforce reliability standards for third-party vendors and platform dependencies
  • Establish SLIs/SLOs for external services and manage vendor performance accordingly
  • Map and govern system dependencies to prevent cascading failures

How Success Is Measured

  • Sustained improvement in platform reliability as measured by SLO attainment
  • High availability and resiliency of client-facing systems
  • Reduction in operational toil and manual intervention across teams
  • Increased deployment velocity without degradation of reliability
  • Adoption of Infrastructure as Code and self-service platform capabilities
  • Reduction in incident frequency and improved detection (MTTD) and recovery (MTTR)
  • Demonstrated transformation from manual operations to engineering-led reliability

Who You Are

1. Engineering-First Mindset & Technical Depth

  • Strong background in Software Engineering or Systems Engineering; you lead reliability through code, not process alone
  • Deep expertise in distributed systems, failure modes, and large-scale platform architecture
  • Passionate about observability, SLOs, and data-driven reliability management

2. Proven Leadership Across Operations and Engineering

  • Experience owning production operations for mission-critical systems
  • Track record of transforming manual, operations-heavy environments into automated, engineering-led platforms
  • Experience building and scaling SRE and/or Platform Engineering capabilities
  • Strong incident leadership experience with a focus on blameless culture and systemic improvement

3. Change Leadership & Organizational Influence

  • Demonstrated ability to drive behavioral change across Engineering and Infrastructure teams
  • Experience embedding operational rigor into the software development lifecycle (SDLC)
  • Ability to balance reliability with product velocity through data-driven tradeoffs

4. Executive & Cross-Functional Leadership

  • Strong partner to Product, Engineering, and Infrastructure leadership
  • Able to communicate clearly with executives during high-pressure incidents
  • Deep understanding of reliability as a core business capability in financial services

Leadership Mandate

You are the architect of trust in AssetMark’s platform.

You will operate today’s platform with excellence while transforming it into an engineered, resilient, and scalable system. By driving an engineering-first approach to production operations, you will enable teams to move faster while improving stability—ensuring our advisors and clients can depend on the platform at all times.

This role is accountable for both running the platform and fundamentally reinventing how it is run.

Compensation: The Base Salary range for this position is between $250,000-$300,000.

 

This information reflects a base salary range that AssetMark reasonably expects to pay for the position based on a number of factors which may include job-related knowledge, skills, education, experience, and actual work location. This position will also be eligible for additional variable incentive compensation and competitive benefits.

Candidates must be legally authorized to work in the US to be considered. We are unable to provide visa sponsorship for this position.

#LI-hybrid

#LI-TN1

Who We Are & What We Offer:

We are AssetMark, a company on the move, shaping the future of financial services. Growth is in our DNA. Every day, we combine technology, insight, and collaboration to create new possibilities for advisors, for our people and for our investors. At AssetMark your ideas matter; they’re heard, valued, and drive meaningful change. Join a team that sets new standards and creates space for you to thrive and do your best work. 

Our Mission 

Our mission is simple: to help our 10,500+ financial advisors make a meaningful difference in their clients’ lives. We do this by combining powerful technology, holistic support, and expert consulting to help advisors run stronger, more efficient businesses. Backed by a comprehensive suite of investment solutions and a trust company that boasts of $150B+ AUM, our platform empowers advisors to deliver exceptional service and an outstanding client experience.

Our Values 

Heart. Client Success. Integrity. Respect. Excellence. Our values are how we show up every day.  

We believe in: 

  • Leading with Heart, in truly making a difference in the lives of others: teammates, clients, investors and communities. 

  • Obsessing over Client Success, bringing a relentless focus on what matters to clients that sets us apart and creates loyal, lasting relationships. 

  • Unyielding Integrity, doing what’s right, always. Even when it’s hard. 

  • Collective Respect, in being authentic, inclusive and valuing all voices while winning together. 

  • Operating with Excellence, in learning fast, continuously improving, innovating and collaborating to find new and better solutions.  

These values shape our culture, guide our decisions, and define what it means to be part of the AssetMark family. 

Our Culture & Benefits 

Our culture brings our mission and values to life. Here, we do what’s right, embrace diverse ideas, and innovate together. We also offer a wide range of benefits to support you and your family—because thriving at work starts with thriving in life. 

  • Flex Time or Paid Time Off and Sick Time Off 

  • 401K – 6% Employer Match 

  • Medical, Dental, Vision – HDHP or PPO 

  • HSA – Employer contribution (HDHP only) 

  • Volunteer Time Off 

  • Career Development / Recognition 

  • Fitness Reimbursement 

  • Hybrid Work Schedule 

     

As an Equal Opportunity Employer, AssetMark is committed to building a diverse and inclusive workplace where everyone feels valued.