Roche

Principal DSX Data Scientist

South San Francisco Full time

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche.

This role is in Analytical Data Science, a core function within Product Development Data Sciences (PDD) that provides strategic leadership and scientific rigor across Development at Roche. PDD Analytical Data Science teams are mobilized across the portfolio to generate data-driven insights, identify opportunities for scale, and implement impactful solutions.

PDD Analytical Data Science is recognized as a leading hub for top industry talent, operating as an agile workforce to deliver regulatory commitments across the portfolio. We identify, influence, and adopt industry-leading digital and automation solutions, develop analytical approaches to support exploratory analyses, and align statistical programming practices across both early- and late-stage clinical development.

The Opportunity

The Data Scientist in the Data Science Acceleration (DSX) team is responsible for developing scalable tools, environments, and workflows that enable efficient, high-quality statistical computing across Product Development Data Sciences (PDD). This role focuses on creating and maintaining next-generation capabilities that support automation of programming workflows, generation of reusable coding macros, and advanced data visualization. Working closely with statistical programmers, biostatisticians, and clinical scientists, the Data Scientist translates scientific and operational needs into robust, modular, and user-friendly solutions that streamline evidence generation and support faster, more reliable decision-making across the development pipeline.

    • You lead the design and development of complex components within DSX-owned statistical platforms and systems, including tools, libraries, and workflows that accelerate statistical computing and insight generation across PDD

    • You translate high-level strategic and scientific needs into scalable, production-grade technical solutions, balancing innovation, usability, and compliance

    • You guide architecture and engineering decisions for core programming infrastructure, ensuring alignment with internal standards and external best practices

    • You provide expert input on programming automation, workflow orchestration, and performance optimization across data science pipelines

    • You act as a thought leader and resource on reproducibility, software quality, and statistical computing efficiency across PDD functions

    • You lead substreams or work packages within global cross-functional projects, working in close collaboration across industry and with internal teams such as biostatistics, programming, translational science, and platform engineering

    • You serve as a mentor to less experienced colleagues, promoting best practices in software development, analytics tooling, and scientific collaboration

    • You drive evaluation and integration of emerging technologies that enhance Roche’s ability to generate reliable, timely, and scalable evidence

      Who you are:

      • You hold a PhD or Master’s degree in Computer Science, Data Science, Statistics, Bioinformatics, Engineering, or a related quantitative field

      • You have a minimum of 5 years of relevant experience in scientific software development, statistical computing, or data science, preferably in a regulated or clinical research environment

      • You have proven experience leading the design and delivery of large-scale tools, infrastructure components, or computing environments that support data analysis across functions

      • You have deep proficiency in programming languages such as Python and/or R, and fluency with best practices in software engineering (e.g., version control, testing, CI/CD, modular code)

      • You have a strong understanding of statistical programming needs in the context of clinical trials, regulatory analysis, or real-world evidence generation

      • You have demonstrated ability to lead cross-functional workstreams and influence the direction of data science tooling and strategy

      • You demonstrate capacity for independent thinking and ability to make decisions based upon sound principles

      • You bring excellent strategic agility including problem-solving and critical thinking skills, and agility that extends beyond the technical domain

      • You demonstrate respect for cultural differences when interacting with colleagues in the global workplace

      • You have excellent verbal and written communication skills, specifically in the areas of presentation and writing, with the ability to explain complex technical concepts in clear language

      Preferred:

      • Experience architecting and maintaining scalable, modular data science tools or platforms used across multiple teams or functions

      • Familiarity with infrastructure and orchestration tools such as Docker, Kubernetes, Airflow, or cloud-native environments (e.g., AWS, GCP, Azure)

      • Strong track record of advancing automation, performance, or reproducibility within statistical or scientific computing workflows

      • Proven ability to influence internal standards, coding frameworks, or platform roadmaps across a matrixed organization

      • Awareness of regulatory and data integrity considerations in the development of software tools used in clinical or translational research

      • Experience leading codebase refactoring, modernization, or integration of emerging technologies into production systems

      Relocation benefits are not available for this posting

      The expected salary range for this position based on the primary location of California is $177,300 - $329,300. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below.

      Benefits

      Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.

      If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form Accommodations for Applicants.