Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Senior Data Engineer - Pathogen

Workable
Oxfordshire
1 month ago
Applications closed

Related Jobs

View all jobs

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

The Ellison Institute of Technology (EIT) tackles humanity’s greatest challenges by turning science and technology into impactful global solutions. Focused on areas like health, food security, sustainable agriculture, climate change, clean energy, and robotics in an era of artificial intelligence. EIT blends groundbreaking research with practical applications to deliver lasting results.

A cornerstone of EIT mission is its upcoming 300,000-square-foot research facility at the Oxford Science Park, set to open in 2027. This cutting-edge campus will feature advanced labs, an oncology and preventative care clinic, and collaborative spaces to strengthen its partnership with the University of Oxford. It will also host the Ellison Scholars, driving innovation for societal benefit.

The Pathogen Mission highlights EIT’s transformative approach, using Whole Genome Sequencing (WGS) and Oracle’s cloud technology to create a global pathogen metagenomics system. This initiative aims to improve diagnostics, provide early epidemic warnings, and guide treatments by profiling antimicrobial resistance. The goal is to deliver certified diagnostic tools for widespread use in labs, hospitals, and public health.

EIT fosters a culture of collaboration, innovation, and resilience, valuing diverse expertise to drive sustainable solutions to humanity’s enduring challenges.

We are looking for a Senior Data Engineer to join the EIT Pathogen Program and play a key role in designing, implementing, and supporting the cloud-based data platform that underpins our mission. This role offers the opportunity to shape the future of pathogen monitoring and diagnostics by building scalable, secure, and high-performance data pipelines. You'll work closely with cross-functional teams, including data architects, platform engineers, and product teams, to deliver the data infrastructure that powers advanced analytics and AI-driven solutions for global health impact.

Key Responsibilities:

  • Ensure data in the platform is acquired, processed, curated, and made accessible to scientists, digital analytics products, bioinformatics, and AI at a high standard of quality and availability
  • Ensure data access adheres to FAIR principles (Findable, Accessible, Interoperable, and Re-usable)
  • Ensure data is secured and compliant with regulatory, legal, and data sharing requirements
  • Ensure efficient, performant, and high-quality pipelines for data ingestion into the platform
  • Contribute to building data management components, including reference data management, de-identification, data curation, pathogen and technical metadata catalogues, and data access controls
  • Ensure efficient, secure, scalable, available, and performant data storage components, including genomic variant storage, clinical data stores, and clinical imaging
  • Ensure robust ingest services capable of seamlessly integrating data from distributed sequencing devices, including real-time telemetry streams
  • Ensure data is processed to enable optimal access and consumption by digital analysis products, bioinformatics pipelines, and researchers/scientists

Requirements

 Essential Knowledge, Skills and Experience:

  • Deep experience in building modern data platforms using cloud-based architectures and tools
  • Experience delivering data engineering solutions on cloud platforms, preferably Oracle OCI, AWS, or Azure
  • Proficient in Python and workflow orchestration tools such as Airflow or Prefect
  • Expert in data modeling, ETL, and SQL
  • Experience with real-time analytics from telemetry and event-based streaming (e.g., Kafka)
  • Experience managing operational data stores with high availability, performance, and scalability
  • Expertise in data lakes, lakehouses, Apache Iceberg, and data mesh architectures
  • Proven ability to build, deliver, and support modern data platforms at scale
  • Strong knowledge of data governance, data quality, and data cataloguing
  • Experience with modern database technologies, including Iceberg, NoSQL, and vector databases
  • Embraces innovation and works closely with scientists and partners to explore cutting-edge technology
  • Knowledge of master data, metadata, and reference data management
  • Understanding of Agile practices and sprint-based methodologies
  • Active contributor to knowledge sharing and collaboration

 Desirable Knowledge, Skills and Experience:

  • Familiarity with genomics and associated data standards
  • Experience with healthcare clinical data and standards such as OMOP and SNOMED
  • Familiarity with containerization tools such as Docker and Kubernetes
  • Familiarity with Git and CI/CD workflows

 Key Attributes:

  • Strong collaborator with excellent communication skills
  • Comfortable working in a fast-paced, dynamic environment
  • Eagerness to learn and cross-train in new technologies
  • Proactive and hands-on approach to exploring new tools and developing proof of concepts (POCs)

Benefits

We offer the following salary and benefits:

  • Salary:  Competitive salary on offer
  • Enhanced holiday pay
  • Pension
  • Life Assurance
  • Income Protection
  • Private Medical Insurance
  • Hospital Cash Plan
  • Therapy Services
  • Perk Box
  • Electrical Car Scheme

 Why work for EIT:

At the Ellison Institute, we believe a collaborative, inclusive team is key to our success. We are building a supportive environment where creative risks are encouraged, and everyone feels heard. Valuing emotional intelligence, empathy, respect, and resilience, we encourage people to be curious and to have a shared commitment to excellence. Join us and make an impact!

 Terms of Appointment:

  • You must have the right to work permanently in the UK with a willingness to travel as necessary.
  • You will live in, relocate to, or be within easy commuting distance of Oxford.
  • During peak periods, some longer hours may be required and some working across multiple time zones due to the global nature of the programme.

 

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

The Future of Data Science Jobs: Careers That Don’t Exist Yet

Data science has rapidly evolved into one of the most important disciplines of the 21st century. Once a niche field combining elements of statistics and computer science, it is now at the heart of decision-making across industries. Businesses, governments, and charities rely on data scientists to uncover insights, forecast trends, and build predictive models that shape strategy. In the UK, data science has become central to economic growth. From the NHS using data to improve patient outcomes to financial institutions modelling risk, the applications are endless. The UK’s thriving tech hubs in London, Cambridge, and Manchester are creating high demand for data talent, with salaries often outpacing other technology roles. Yet despite its current importance, data science is still in its infancy. Advances in artificial intelligence, quantum computing, automation, and ethics will transform what data scientists do. Many of the most vital data science jobs of the next two decades don’t exist yet. This article explores why new careers are emerging, the roles likely to appear, how current jobs will evolve, why the UK is well positioned, and how professionals can prepare now.

Seasonal Hiring Peaks for Data Science Jobs: The Best Months to Apply & Why

The UK's data science sector has matured into one of Europe's most intellectually rewarding and financially attractive technology markets, with roles spanning from junior data analysts to principal data scientists and heads of artificial intelligence. With data science positions commanding salaries from £30,000 for graduate data analysts to £140,000+ for senior principal scientists, understanding when organisations actively recruit can dramatically accelerate your career progression in this intellectually stimulating and rapidly evolving field. Unlike traditional analytical roles, data science hiring follows distinct patterns influenced by business intelligence cycles, research funding schedules, and machine learning project timelines. The sector's unique combination of mathematical rigour, business impact requirements, and cutting-edge technology adoption creates predictable hiring windows that strategic professionals can leverage to advance their careers in extracting insights from tomorrow's data. This comprehensive guide explores the optimal timing for data science job applications in the UK, examining how enterprise analytics strategies, academic research cycles, and artificial intelligence initiatives influence recruitment patterns, and why strategic timing can determine whether you join a pioneering AI research team or miss the opportunity to develop the next generation of intelligent systems.

Pre-Employment Checks for Data Science Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in data science reflects the discipline's unique position at the intersection of statistical analysis, machine learning innovation, and strategic business intelligence. Data scientists often have privileged access to comprehensive datasets, proprietary algorithms, and business-critical insights that form the foundation of organisational strategy and competitive positioning. The data science industry operates within complex regulatory frameworks spanning GDPR, sector-specific data protection requirements, and emerging AI governance regulations. Data scientists must demonstrate not only technical competence in statistical modelling and machine learning but also deep understanding of research ethics, data privacy principles, and the societal implications of algorithmic decision-making. Modern data science roles frequently involve analysing personally identifiable information, financial data, healthcare records, and sensitive business intelligence across multiple jurisdictions and regulatory frameworks simultaneously. The combination of analytical privilege, predictive capabilities, and strategic influence makes thorough candidate verification essential for maintaining compliance, security, and public trust in data-driven insights and automated systems.