National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Senior Data Scientist

Wellcome Sanger Institute
Saffron Walden
1 week ago
Create job alert

Social network you want to login/join with:
Do you want to help us improve human health and understand life on Earth? Make your mark by shaping the future to enable or deliver life-changing science to solve some of humanity’s greatest challenges.
Senior Data Scientist

We seek a senior machine learning research scientist to join a collaborative project between the Wellcome Sanger Institute and Open Targets (targets ( This project aims to leverage datasets internally generated at the Sanger Institute and publicly available data from human cells to create foundational models for biology, enhancing our understanding of life's rules and improving health for all. You will work within an interdisciplinary team of life scientists and computer/ML scientists, with a shared objective of advancing biological research through these foundational models. This role will sit within the AI/ML Faculty group led by Dr. Mohammad Lotfollahi, and the successful candidates, across different seniority levels (senior and principal), will be responsible for delivering their portfolio of scientific research projects as part of the broader team strategy.
About the role
Your role will involve designing foundational models leveraging multi-modal readouts. This includes integrating and processing data from various sources to develop robust and versatile AI models. To achieve this, you will work with open-source software, proposing, developing, and maintaining new solutions to analyze and interpret large-scale single-cell datasets. We have access to unique data and are also in the position to generate data to train unique models. Additionally, we have substantial computational power and GPU resources to train large models efficiently.
Our teams are well-positioned to tackle this problem with experience in both generating and analyzing datasets, including millions of cells across multiple tissues and conditions (e.g., disease, healthy). This involves a detailed understanding of the training of large-scale ML models and a track record of undertaking large data-science projects.
You will be responsible for:
Independently managing and leading machine learning research projects and writing outcomes in a scientific publication for submission to journals or machine learning conferences (ICLR, ICML, CVPR, etc).
Collaborating with team members in proposing, developing, and evaluating new machine learning models that enable understanding single-cell data and its application in drug discovery.
Working with Ph.D. students and postdocs in collaborating teams on developing solutions for interdisciplinary scientific problems in biology as well as providing supervision and training to junior members of the team.
Contributing to writing scientific papers on biotechnology and biology.
Distilling your developed solutions into open-source and easy-to-install packages with documentation that facilitates the usage of your solution for downstream users, including biologists and bioinformaticians.
Presenting your research and analysis pipelines to internal and external audiences.
About You:
You will be supported in your personal and professional development and have the opportunity to lead peer-reviewed publications around using genetics and genomics approaches to guide drug discovery and present them at national and international conferences.
● Ph.D. or M.Sc. with equivalent research experience in a relevant quantitative discipline (e.g., Computer Science, Computational Biology, Genetics, Bioinformatics, Physics, Engineering, or Applied Statistics/Mathematics)
● Previous ML work experience in scientific/academic environment (RA/Internships are considered as work experience)
● Strong knowledge of Python, including core data science libraries such as Scikit-Learn, SciPy, TensorFlow, and PyTorch.
● Expertise in machine learning algorithms and frameworks, with experience in designing, training, and deploying ML models.
● Proficiency in handling and processing large datasets, including techniques for data cleaning, feature engineering, and data augmentation.
● Experience with high-performance computing environments, including the use of GPUs for training large-scale machine learning models.
● Experience in natural language processing (NLP) and training models based on transformer architectures, such as BERT and GPT.
● Familiarity with generative models such as diffusion models and flow matching.
● Knowledge of software development good practices and collaboration tools, including git-based version control, Python package management, and code reviews.
● Strong problem-solving skills with the ability to analyze complex data and derive actionable insights.
● Excellent communication skills, with the ability to explain complex machine learning algorithms and statistical methods to non-technical stakeholders.
Evidence of related work experience as a researcher in the area of Machine learning
In addition to the above technical skills, you will also have the following:
Ability to quickly understand scientific, technical, and process challenges and breakdown complex problems into actionable steps
Ability to work in a frequently changing environment with the capability to interpret management information to amend plans
Ability to prioritize, manage workload, and deliver agreed activities consistently on time
Demonstrate good networking, influencing and relationship building skills
Strategic thinking is the ability to see the ‘bigger picture'
Ability to build collaborative working relationships with internal and external stakeholders at all levels
Demonstrates inclusivity and respect for all
Relevant publication of the groups:
Lotfollahi, M ., Naghipourfar, M., Luecken, M. D., Khajavi, M., Büttner, M., Wagenstetter, M., Avsec, Ž., Gayoso, A., Yosef, N., Interlandi, M. & Others. Mapping single-cell data to reference atlases by transfer learning.

Nature Biotechnology

1–10 .
Lotfollahi, M. , Wolf, F. A. & Theis, F. J. scGen predicts single-cell perturbation responses.

Nature Methods

16, 715–721 .
Lotfollahi, M. , Rybakov, S., Hrovatin, K., Hediyeh-Zadeh, S., Talavera-López, C., Misharin, A. V. & Theis, F. J. Biologically informed deep learning to query gene programs in single cell atlases.

Nature Cell Biology .

#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist

Senior Data Scientist

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Present Data Science Solutions to Non-Technical Audiences: A Public Speaking Guide for Job Seekers

The ability to communicate clearly is now just as important as knowing how to build a predictive model or fine-tune a neural network. In fact, many UK data science job interviews are now designed to test your ability to explain your work to non-technical audiences—not just your technical competence. Whether you’re applying for your first data science role or moving into a lead or consultancy position, this guide will show you how to structure your presentation, simplify technical content, design effective visuals, and confidently answer stakeholder questions.

Data Science Jobs UK 2025: 50 Companies Hiring Now

Bookmark this guide—refreshed every quarter—so you always know who’s really expanding their data‑science teams. Budgets for predictive analytics, GenAI pilots & real‑time decision engines keep climbing in 2025. The UK’s National AI Strategy, tax relief for R&D & a sharp rise in cloud adoption mean employers need applied scientists, ML engineers, experiment designers, causal‑inference specialists & analytics leaders—right now. Below you’ll find 50 organisations that have advertised UK‑based data‑science vacancies or announced head‑count growth during the past eight weeks. They’re grouped into five quick‑scan categories so you can jump straight to the kind of employer—& culture—that suits you. For every company you’ll see: Main UK hub Example live or recent vacancy Why it’s worth a look (tech stack, mission, culture) Search any employer on DataScience‑Jobs.co.uk to view current ads, or set up a free alert so fresh openings land straight in your inbox.

Return-to-Work Pathways: Relaunch Your Data Science Career with Returnships, Flexible & Hybrid Roles

Returning to work after an extended break can feel like stepping into a whole new world—especially in a dynamic field like data science. Whether you paused your career for parenting, caring responsibilities or another life chapter, the UK’s data science sector now offers a variety of return-to-work pathways. From structured returnships to flexible and hybrid roles, these programmes recognise the transferable skills and resilience you’ve gained and provide mentorship, upskilling and supportive networks to ease your transition back. In this guide, you’ll discover how to: Understand the current demand for data science talent in the UK Leverage your organisational, communication and analytical skills in data science roles Overcome common re-entry challenges with practical solutions Refresh your technical knowledge through targeted learning Access returnship and re-entry programmes tailored to data science Find roles that fit around family commitments—whether flexible, hybrid or full-time Balance your career relaunch with caring responsibilities Master applications, interviews and networking specific to data science Learn from inspiring returner success stories Get answers to common questions in our FAQ section Whether you aim to return as a data analyst, machine learning engineer, data visualisation specialist or data science manager, this article will map out the steps and resources you need to reignite your data science career.