Senior Data Engineer - Pathogen

Ellison Institute of Technology Oxford
Oxford
1 week ago
Create job alert
Senior Data Engineer, EIT Pathogen Program

The Ellison Institute of Technology (EIT) tackles humanity's greatest challenges by turning science and technology into impactful global solutions. Focused on areas such as health, food security, sustainable agriculture, climate change, clean energy, and robotics in an era of artificial intelligence, EIT blends groundbreaking research with practical applications to deliver lasting results.


A cornerstone of EIT's mission is its upcoming 300,000-square‑foot research facility at the Oxford Science Park, set to open in 2027. This campus will feature advanced labs, an oncology and preventative care clinic, and collaborative spaces to strengthen its partnership with the University of Oxford. It will also host the Ellison Scholars, driving innovation for societal benefit.


The Pathogen Mission highlights EIT's transformative approach, using Whole Genome Sequencing (WGS) and Oracle's cloud technology to create a global pathogen metagenomics system. This initiative aims to improve diagnostics, provide early epidemic warnings, and guide treatments by profiling antimicrobial resistance. The goal is to deliver certified diagnostic tools for widespread use in labs, hospitals, and public health.


EIT fosters a culture of collaboration, innovation, and resilience, valuing diverse expertise to drive sustainable solutions to humanity's enduring challenges.


Key Responsibilities

  • Ensure data in the platform is acquired, processed, curated, and made accessible to scientists, digital analytics products, bioinformatics, and AI at a high standard of quality and availability.
  • Ensure data access adheres to FAIR principles (Findable, Accessible, Interoperable, and Re‑usable).
  • Ensure data is secured and compliant with regulatory, legal, and data sharing requirements.
  • Ensure efficient, performant, and high‑quality pipelines for data ingestion into the platform.
  • Contribute to building data management components, including reference data management, de‑identification, data curation, pathogen and technical metadata catalogues, and data access controls.
  • Ensure efficient, secure, scalable, available, and performant data storage components, including genomic variant storage, clinical data stores, and clinical imaging.
  • Ensure robust ingest services capable of seamlessly integrating data from distributed sequencing devices, including real‑time telemetry streams.
  • Ensure data is processed to enable optimal access and consumption by digital analysis products, bioinformatics pipelines, and researchers/scientists.

Requirements
Essential Knowledge, Skills and Experience

  • Deep experience in building modern data platforms using cloud‑based architectures and tools.
  • Experience delivering data engineering solutions on cloud platforms, preferably Oracle OCI, AWS, or Azure.
  • Proficient in Python and workflow orchestration tools such as Airflow or Prefect.
  • Expert in data modeling, ETL, and SQL.
  • Experience with real‑time analytics from telemetry and event‑based streaming (e.g., Kafka).
  • Experience managing operational data stores with high availability, performance, and scalability.
  • Expertise in data lakes, lakehouses, Apache Iceberg, and data mesh architectures.
  • Proven ability to build, deliver, and support modern data platforms at scale.
  • Strong knowledge of data governance, data quality, and data cataloguing.
  • Experience with modern database technologies, including Iceberg, NoSQL, and vector databases.
  • Embraces innovation and works closely with scientists and partners to explore cutting‑edge technology.
  • Knowledge of master data, metadata, and reference data management.
  • Understanding of Agile practices and sprint‑based methodologies.
  • Active contributor to knowledge sharing and collaboration.

Desirable Knowledge, Skills and Experience

  • Familiarity with genomics and associated data standards.
  • Experience with healthcare clinical data and standards such as OMOP and SNOMED.
  • Familiarity with containerization tools such as Docker and Kubernetes.
  • Familiarity with Git and CI/CD workflows.

Key Attributes

  • Strong collaborator with excellent communication skills.
  • Comfortable working in a fast‑paced, dynamic environment.
  • Eagerness to learn and cross‑train in new technologies.
  • Proactive and hands‑on approach to exploring new tools and developing proof of concepts (POCs).

Benefits

  • Competitive salary on offer.
  • Enhanced holiday pay.
  • Pension.
  • Life Assurance.
  • Income Protection.
  • Private Medical Insurance.
  • Hospital Cash Plan.
  • Therapy Services.
  • Perk Box.
  • Electric Car Scheme.

Why work for EIT

At the Ellison Institute, we believe a collaborative, inclusive team is key to our success. We are building a supportive environment where creative risks are encouraged and everyone feels heard. Valuing emotional intelligence, empathy, respect, and resilience, we encourage people to be curious and to have a shared commitment to excellence. Join us and make an impact!


Terms of Appointment

  • You must have the right to work permanently in the UK with a willingness to travel as necessary.
  • You will live in, relocate to, or be within easy commuting distance of Oxford.
  • During peak periods, some longer hours may be required and some working across multiple time zones due to the global nature of the programme.


#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.

Data Science Jobs for Career Switchers in Their 30s, 40s & 50s (UK Reality Check)

Thinking about switching into data science in your 30s, 40s or 50s? You’re far from alone. Across the UK, businesses are investing in data science talent to turn data into insight, support better decisions and unlock competitive advantage. But with all the hype about machine learning, Python, AI and data unicorns, it can be hard to separate real opportunities from noise. This article gives you a practical, UK-focused reality check on data science careers for mid-life career switchers — what roles really exist, what skills employers really hire for, how long retraining typically takes, what UK recruiters actually look for and how to craft a compelling career pivot story. Whether you come from finance, marketing, operations, research, project management or another field entirely, there are meaningful pathways into data science — and age itself is not the barrier many people fear.

How to Write a Data Science Job Ad That Attracts the Right People

Data science plays a critical role in how organisations across the UK make decisions, build products and gain competitive advantage. From forecasting and personalisation to risk modelling and experimentation, data scientists help translate data into insight and action. Yet many employers struggle to attract the right data science candidates. Job adverts often generate high volumes of applications, but few applicants have the mix of analytical skill, business understanding and communication ability the role actually requires. At the same time, experienced data scientists skip over adverts that feel vague, inflated or misaligned with real data science work. In most cases, the issue is not a lack of talent — it is the quality and clarity of the job advert. Data scientists are analytical, sceptical of hype and highly selective. A poorly written job ad signals unclear expectations and immature data practices. A well-written one signals credibility, focus and serious intent. This guide explains how to write a data science job ad that attracts the right people, improves applicant quality and positions your organisation as a strong data employer.