Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Principle Data Engineer

Generative Group
City of London
2 days ago
Create job alert
Overview

Our client in the Life Science industry is a startup in stealth mode backed by strong funding. They are seeking a Principal Data Engineer to lead the data and infrastructure systems powering the foundation model transforming drug development.

Responsibilities
  • Lead data and infrastructure systems powering foundation model initiatives in drug development.
  • Own data workflows end-to-end, from extraction and transformation to clean Parquet outputs for machine learning teams.
  • Collaborate closely with wet lab teams; practically understand assays and protocol development.
  • Set up cloud data infrastructure from scratch, including compute, storage, networking, and access controls.
  • Build reliable, repeatable pipelines with testing, version control, and clear documentation.
  • Maintain data quality, lineage, and monitoring; implement sound data modeling practices.
Qualifications (Requirements)
  • Principal-level data engineering experience in life sciences is essential.
  • End-to-end ownership of data workflows from extraction to machine learning-ready outputs (Parquet).
  • Hands-on familiarity with genomics data, including raw FASTQ files and Illumina sequencer outputs.
  • Experience with metabolomics data, particularly untargeted mass spectrometry.
  • Strong collaboration with wet lab teams and practical understanding of assays and protocol development.
  • Cloud data infrastructure built from scratch (compute, storage, networking, access controls).
  • Strong Python and SQL skills; proficient in data modeling, data quality, lineage, and monitoring.
  • Ability to design and maintain reliable pipelines with testing and documentation.
Preferences
  • Experience building data lakes or lakehouses and automating batch workflows (e.g., Airflow).
  • Familiarity with NGS pipelines (quality control, alignment/assembly, variant calling) and mass spectrometry data analysis.
  • Use of Infrastructure as Code (Terraform), containerization (Docker), and CI/CD for deploying data systems.
  • Prior 0-to-1 startup experience and close collaboration with ML and biology teams.
Why Join
  • Design and build cloud infrastructure and data pipelines powering distributed ML training and scalable biological data workflows—without legacy constraints.
  • Work with first-of-their-kind, multi-modal datasets to support foundation model training at AlphaFold scale; this is a builder role with deep technical ownership.
  • Join as a founding member of the engineering team with significant equity and end-to-end system ownership.
  • See your work directly enable drug discoveries that will impact millions, collaborating with world-leading scientists in microbiome research and machine learning.

Location: London - 3 days onsite
Salary: £ 80 000 - £ 120 000 plus equity


#J-18808-Ljbffr

Related Jobs

View all jobs

Principle Data Engineer

Principle Data Engineer in Nottingham - Commify

Senior Data Engineer

Data Engineer

Junior Data Scientist

Senior Data Scientist

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data science once meant advanced statistics, machine learning models and coding in Python or R. In the UK today, it has become one of the most in-demand professions across sectors — from healthcare to finance, retail to government. But as the field matures, employers now expect more than technical modelling skills. Modern data science is multidisciplinary. It requires not just coding and algorithms, but also legal knowledge, ethical reasoning, psychological insight, linguistic clarity and human-centred design. Data scientists are expected to interpret, communicate and apply data responsibly, with awareness of law, human behaviour and accessibility. In this article, we’ll explore why data science careers in the UK are becoming more multidisciplinary, how these five disciplines intersect with data science, and what job-seekers & employers need to know to succeed in this transformed field.

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Data science is one of the most in-demand, dynamic, and multidisciplinary areas in the UK tech and business landscape. Organisations from finance, retail, health, government, and beyond are using data to drive decisions, automate processes, personalise services, predict trends, detect fraud, and more. To do that well, companies don’t just need good data scientists; they need teams with clearly defined roles, responsibilities, workflows, collaboration, and governance. If you're aiming for a role in data science or recruiting for one, understanding the structure of a data science department—and who does what—can make all the difference. This article breaks down the key roles, how they interact across the lifecycle of a data science project, what skills and qualifications are typical in the UK, expected salary ranges, challenges, trends, and how to build or grow an effective team.

Why the UK Could Be the World’s Next Data Science Jobs Hub

Data science is arguably the most transformative technological field of the 21st century. From powering artificial intelligence algorithms to enabling complex business decisions, data science is essential across sectors. As organisations leverage data more rapidly—from retailers predicting customer behaviour to health providers diagnosing conditions—demand for proficiency in data science continues to surge. The United Kingdom is particularly well-positioned to become a global data science jobs hub. With world-class universities, a strong tech sector, growing AI infrastructure, and supportive policy environments, the UK is poised for growth. This article delves into why the UK could emerge as a leading destination for data science careers, explores the job market’s current state, outlines future opportunities, highlights challenges, and charts what must happen to realise this vision.