GSK is seeking an exceptional and visionary scientist to join the Oligo Design and Data Science group as a Senior Principal Scientist. This pivotal role is at the heart of our strategy to accelerate drug discovery by leveraging massive-scale screening data and machine learning. You will be responsible for architecting and implementing the next generation of our DNA Encoded platforms informatics. As a senior member of the team, you will play a critical role in driving our data strategy and shaping the future of hit-finding at GSK.
In this position, you will lead the design and development of critical software infrastructure, from automated ETL pipelines that process terabyte-scale sequencing data to sophisticated web applications and interactive dashboards that enable data-driven decision-making. Your expertise will be instrumental in developing and applying novel statistical methods for analyzing selection data from both small molecule and oligonucleotide libraries, building robust machine learning models to predict structure-activity relationships, and exploring deep learning approaches for hit identification. You will work in a deeply collaborative, cross-functional environment alongside experts in oligo therapeutic design, chemistry, biology, and biophysics to translate complex data into actionable hypotheses that guide our therapeutic discovery programs.
The ideal candidate will possess a PhD in computational science and a proven history of building scientific computing platforms from the ground up in a drug discovery setting. Deep expertise in cheminformatics, DNA Encoded Library (DEL) data analysis, and the development of scientific applications using Python (e.g., pandas, scikit-learn, Django), SQL, and modern cloud infrastructure is essential. We are looking for a strategic thinker and a hands-on builder who is passionate about leveraging computation to solve challenging biological problems and is excited by the opportunity to have a significant impact on the future of data-driven drug discovery at GSK.
Key Responsibilities:
Drive data science initiatives to support informed decision-making in active early-stage small molecule and oligonucleotide discovery projects.
Collaborate with laboratory scientists to build data infrastructure, develop decision-making heuristics, and implement tracking systems for early discovery oligonucleotide and DEL projects, supporting workflows from initial screening through candidate selection.
Collaborate closely with research tech and AI/ML teams to architect, develop, and optimize predictive informatics platforms that enable scalable data integration, advanced statistical analytics, and actionable insights for therapeutic discovery.
Why You?
Basic Qualifications:
We are looking for professionals with these required skills to achieve our goals:
PhD in computational science, bioinformatics, cheminformatics, computer science, or a closely related discipline.
Experience in cheminformatics and DNA-encoded library (DEL) data analysis, including the application of advanced statistical and computational methods to large-scale biological datasets.
Experience developing scientific applications using Python (such as pandas, scikit-learn, Django), SQL, and deploying solutions on modern cloud infrastructure.
On-site presence of 2–3 days per week, as required for team collaboration and project delivery.
Preferred Qualifications:
If you have the following characteristics, it would be a plus:
Experience leading platform development initiatives that integrate research technology, artificial intelligence, and machine learning for scalable data analysis and informatics solutions.
Significant contributions to open-source scientific software projects or recognized achievement in computational life science competitions (e.g., Kaggle, TopCoder, DREAM Challenge).
Expertise in the design and optimization of automated ETL pipelines for processing terabyte-scale sequencing or screening data.
Advanced knowledge of predictive modeling, Bayesian statistics, and deep learning approaches for hit identification and structure-activity relationship prediction.
Demonstrated success in cross-functional communication, matrixed collaboration, and thought leadership within multidisciplinary teams.
Strong analytical and problem-solving skills, with a track record of translating complex biological questions into actionable computational solutions.
Ability to work collaboratively in cross-functional teams, communicating effectively with experts in chemistry, biology, biophysics, and data science.
Experience with analysis of siRNA knockdown screens or CRISPR knockout libraries.
Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.
Why GSK?
Uniting science, technology and talent to get ahead of disease together.
GSK is a global biopharma company with a purpose to unite science, technology and talent to get ahead of disease together. We aim to positively impact the health of 2.5 billion people by the end of the decade, as a successful, growing company where people can thrive. We get ahead of disease by preventing and treating it with innovation in specialty medicines and vaccines. We focus on four therapeutic areas: respiratory, immunology and inflammation; oncology; HIV; and infectious diseases – to impact health at scale.
People and patients around the world count on the medicines and vaccines we make, so we’re committed to creating an environment where our people can thrive and focus on what matters most. Our culture of being ambitious for patients, accountable for impact and doing the right thing is the foundation for how, together, we deliver for patients, shareholders and our people.
If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).
GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.
Important notice to Employment businesses/ Agencies
GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.
Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit the Centers for Medicare and Medicaid Services (CMS) website at https://openpaymentsdata.cms.gov/