Dimensional

Manager, Site Reliability Engineering

Austin Full time

Job Description:

Dimensional Fund Advisors is a global investment firm guided by deep convictions about the power of capital markets. We are a leader in applying advanced financial science to equity and fixed income investment strategies. By employing a rigorous and systematic investment approach, we seek to capture what the market offers in all its dimensions. 
 
For more than 35 years, we have translated research into real-world investment solutions for clients. Headquartered in Austin, Texas, with 13 offices around the world. 
 

Job Description and Purpose: 

We are seeking a strategic, hands-on Manager of Site Reliability Engineering to lead a global team of SREs and ensure the stability and performance of our data platforms. You will be responsible for increasing the breadth of SRE coverage and deepening the technical expertise of your team on the solutions they support. You will partner closely with engineering, data stewardship, infrastructure, platform engineering, and executives of all levels to ensure the reliable delivery of business-critical data products. This role requires a unique blend of technical proficiency and strong leadership skills to navigate cross-functional negotiations and drive a culture of reliability and continuous improvement. 

  You may be a fit for this role if you:  

  • Are open-minded, curious, and resourceful.  

  • Lead with vision and purpose to bring about transformational change.  

  • Are passionate about/stay current with modern technologies/solutions.  

  • Solve problems systematically and transparently.  

  • Share ideas, solicit/integrate feedback, design and solve collaboratively.  

  • Demonstrate automation and security mindsets.  

What you might work on:

  • Team Leadership & Development: Manage a global team of SREs, driving professional growth and operational excellence through coaching and mentorship. 

  • Service Reliability & Health: Own our monitoring strategy and keep the team apprised of our service’s health and performance indicators through dashboarding and alerting. This includes regular benchmarking and maintaining timestamped, attributed notes regarding changes in expectations. 

  • Strategic Planning: Lead infrastructure capacity planning and headroom management for the team and its infrastructure to ensure we scale effectively. 

  • Error Budgets & Service Levels: Collaborate with product and engineering teams to negotiate and manage error budgets, SLOs and SLIs. 

  • Process & Standardization: Drive the standardization of approaches, logging practices, and observability across the organization.  

  • Documentation: Develop a strategy for intuitively navigable documentation and oversee its implementation, ensuring all our existing and future products are sufficiently covered. 

  • Cross-Functional Collaboration: Act as the primary liaison between SRE, TPMs, DevOps, development teams and business stakeholders. Negotiate investments in our solutions to enhance their supportability. 

  • Automation: Relentlessly pursue opportunities to eradicate toil through automation. 

  • Quality Assurance: Build confidence in deployments through enhanced data quality assurance processes, consistently coordinated deployments and automated testing. 

  • Incident Management: Lead the debugging, troubleshooting, diagnosing, and resolving incidents, ensuring rapid response and effective post-mortems.

Qualifications & Technical Skills 

  • Observability: Deep expertise in ELK, Prometheus, and Grafana. 

  • Programming & Development: Proficiency in Python-based service development, Linux administration, and CI/CD. 

  • Data Engineering: Experience with data flows using Airflow, dbt and Snowflake. 

  • Testing: Capability to write and run automated tests. 

  • Project Delivery: Experience running software projects from ideation through design, implementation, deployment and operations. 

  • Soft Skills: Demonstrated ability to be self-organized and self-driven with strong communication skills to influence cross-functional partners at all levels. 

Scope & Direct Reports:

Direct supervision of a global team of SREs

#LI-Hybrid

    

Dimensional offers a variety of programs to help take care of you, your family, and your career, including comprehensive benefits, educational initiatives, and special celebrations of our history, culture, and growth.

It is the policy of the Company to provide equal opportunity for all employees and applicants.  The Company recruits, hires, trains, promotes, compensates, and administers all personnel actions without regard to actual or perceived race, color, religion, religious practice, creed, sex, sex stereotyping, pregnancy (which includes pregnancy, childbirth, and medical conditions related to pregnancy, childbirth, or breastfeeding), caregiver status, gender, gender identity, gender expression, transgender identity, national origin, age, mental or physical disability, ancestry, medical condition, marital status, familial status, domestic partnership status, military or veteran status or service, unemployment status, citizenship status or alienage, sexual orientation, status as a victim of domestic violence, status as a victim of stalking, status as a victim of sex offenses, genetic information, political activities or recreational activities, arrest or conviction record, salary history, natural hairstyle or any other status protected by applicable law except as otherwise required or permitted by law or regulation applicable to the Company or its affiliates.