Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Principle Data Engineer

Generative Group
City of London
2 weeks ago
Create job alert
Overview

Our client in the Life Science industry is a startup in stealth mode backed by strong funding. They are seeking a Principal Data Engineer to lead the data and infrastructure systems powering the foundation model transforming drug development.

Responsibilities
  • Lead data and infrastructure systems powering foundation model initiatives in drug development.
  • Own data workflows end-to-end, from extraction and transformation to clean Parquet outputs for machine learning teams.
  • Collaborate closely with wet lab teams; practically understand assays and protocol development.
  • Set up cloud data infrastructure from scratch, including compute, storage, networking, and access controls.
  • Build reliable, repeatable pipelines with testing, version control, and clear documentation.
  • Maintain data quality, lineage, and monitoring; implement sound data modeling practices.
Qualifications (Requirements)
  • Principal-level data engineering experience in life sciences is essential.
  • End-to-end ownership of data workflows from extraction to machine learning-ready outputs (Parquet).
  • Hands-on familiarity with genomics data, including raw FASTQ files and Illumina sequencer outputs.
  • Experience with metabolomics data, particularly untargeted mass spectrometry.
  • Strong collaboration with wet lab teams and practical understanding of assays and protocol development.
  • Cloud data infrastructure built from scratch (compute, storage, networking, access controls).
  • Strong Python and SQL skills; proficient in data modeling, data quality, lineage, and monitoring.
  • Ability to design and maintain reliable pipelines with testing and documentation.
Preferences
  • Experience building data lakes or lakehouses and automating batch workflows (e.g., Airflow).
  • Familiarity with NGS pipelines (quality control, alignment/assembly, variant calling) and mass spectrometry data analysis.
  • Use of Infrastructure as Code (Terraform), containerization (Docker), and CI/CD for deploying data systems.
  • Prior 0-to-1 startup experience and close collaboration with ML and biology teams.
Why Join
  • Design and build cloud infrastructure and data pipelines powering distributed ML training and scalable biological data workflows—without legacy constraints.
  • Work with first-of-their-kind, multi-modal datasets to support foundation model training at AlphaFold scale; this is a builder role with deep technical ownership.
  • Join as a founding member of the engineering team with significant equity and end-to-end system ownership.
  • See your work directly enable drug discoveries that will impact millions, collaborating with world-leading scientists in microbiome research and machine learning.

Location: London - 3 days onsite
Salary: £ 80 000 - £ 120 000 plus equity


#J-18808-Ljbffr

Related Jobs

View all jobs

Principle Data Engineer

Senior Data Engineer

Naimuri - Data Engineer

Naimuri - Data Engineer

Senior/Lead Data Engineer

Principle Data Architect - Edinburgh, Not Office Based (UK)

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Data Science Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK data science hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise rigorous problem framing, high‑quality analytics & modelling, experiment/causality, production awareness (MLOps), governance/ethics, and measurable product or commercial impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for product/data scientists, applied ML scientists, decision scientists, econometricians, growth/marketing analysts, and ML‑adjacent data scientists supporting LLM/AI products. Who this is for: Product/decision/data scientists, applied ML scientists, econometrics & causal inference specialists, experimentation leads, analytics engineers crossing into DS, ML generalists with strong statistics, and data scientists collaborating with platform/MLOps teams in the UK.

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data science once meant advanced statistics, machine learning models and coding in Python or R. In the UK today, it has become one of the most in-demand professions across sectors — from healthcare to finance, retail to government. But as the field matures, employers now expect more than technical modelling skills. Modern data science is multidisciplinary. It requires not just coding and algorithms, but also legal knowledge, ethical reasoning, psychological insight, linguistic clarity and human-centred design. Data scientists are expected to interpret, communicate and apply data responsibly, with awareness of law, human behaviour and accessibility. In this article, we’ll explore why data science careers in the UK are becoming more multidisciplinary, how these five disciplines intersect with data science, and what job-seekers & employers need to know to succeed in this transformed field.

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Data science is one of the most in-demand, dynamic, and multidisciplinary areas in the UK tech and business landscape. Organisations from finance, retail, health, government, and beyond are using data to drive decisions, automate processes, personalise services, predict trends, detect fraud, and more. To do that well, companies don’t just need good data scientists; they need teams with clearly defined roles, responsibilities, workflows, collaboration, and governance. If you're aiming for a role in data science or recruiting for one, understanding the structure of a data science department—and who does what—can make all the difference. This article breaks down the key roles, how they interact across the lifecycle of a data science project, what skills and qualifications are typical in the UK, expected salary ranges, challenges, trends, and how to build or grow an effective team.