Data Engineer (Healthcare Data)

Pangaea Data Limited
London
1 month ago
Create job alert

As Data Engineer you will join Pangaea’s team to design and develop integrated applications for its PALLUX platform.

London Technical

Pangaea Data (Pangaea) is a South San Francisco and London based business founded by Dr Vibhor Gupta and Prof Yike Guo (Director Data Science Institute at Imperial College London; Provost, Hong Kong University of Science and Technology). They have worked in medicine and computing for over 20 years and have raised over $300 million through their academic research, including a $110 million grant focused on development work on large language models in medicine. Pangaea’s AI platform, PALLUX, is configured on clinical guidelines to find more untreated (undiagnosed, miscoded, at-risk) and under-treated patients with hard-to-diagnose conditions for screening and treatment at the point of care. Pangaea’s advisors include industry veterans from healthcare and the life sciences, including Lord David Prior (former chairman, NHS England) and Mr. Andy Palmer (former CIO, Novartis).

The Role

As Data Engineer (Healthcare Data), you will join Pangaea’s team to lead and support the development of reliable, scalable, and secure data solutions. The ideal candidate will be experienced with healthcare data standards (e.g. FHIR, OMOP), possess a strong understanding of data privacy regulations (e.g., HIPAA, GDPR), and have technical expertise to design and implement data pipelines, storage systems, and integrations.

This role will continue to evolve as the business grows, but in the short term it will also involve development of the software product and collaboration with the clinical and scientific team. A strong software engineering background and knowledge in AI, especially Machine Learning and Natural Language Processing, is essential. For the right candidate, this is a senior technical position with scope to grow into a leadership role.

Key technical responsibilities will include:

  • Design, implement, and maintain ETL pipelines to collect, clean, and transform healthcare data from various sources such as EHR systems, APIs, and databases.
  • Ensure data quality and integrity through robust testing and validation processes.
  • Optimize storage solutions for structured and unstructured healthcare data using databases (e.g., MongoDB) and cloud-based data warehouses (e.g., Azure Cosmos, Azure Fabric).
  • Maintain strict compliance with data privacy regulations such as HIPAA, GDPR, and other local healthcare policies.
  • Work closely with the clinical team to understand data requirements and translate them into technical solutions.
  • Collaborate with the AI team to provide clean, well-structured datasets for research, and AI/ML models.
  • Stay up-to-date with the latest data engineering technologies and best practices.

Mandatory Requirements

Technical skills:

  • A university qualification (Bachelors, Masters, Doctorate) with at least two years of university study in Computer Science, Informatics, Data Science, Engineering, or related fields.
  • Experience in data engineering, with a focus on healthcare data preferred.
  • Familiarity with NoSQL databases (e.g., MongoDB) and relational databases (e.g., PostgreSQL, MySQL).
  • 5+ years in Python and SQL work.
  • Knowledge of ETL tools (e.g., Apache Airflow) and cloud platforms (e.g., AWS, Azure, GCP).
  • Understand data modelling concepts and best practices. Experience with healthcare data standards (e.g., HL7, FHIR, ICD, SNOMED, DICOM) preferred.
  • Excellent problem-solving and communication skills.
  • Ability to communicate complex ideas effectively, both verbally and in writing.
  • Ability to engage all levels of the company and the customers’ organizations.
  • Ability to work collaboratively in a team environment.

Nice to Have

  • 3-5 years experience of managing teams.
  • Experience working on large-scale, commercial software development projects is a plus.
  • Experience with research communities and/or efforts, including having published papers (being listed as author) at AI/ML/NLP/CV conferences (e.g. Bio-IT, NeuraIPS, ICML, ICLR, ACL, CVPR and KDD) and journals.
  • Experience and knowledge of deploying AI and Data solutions for healthcare and pharmaceuticals at scale is desirable.

Perks and Benefits

  • Salary dependent on experience.
  • Package of attractive benefits including private medical insurance and monthly travel card.
  • You will join a dedicated highly renowned team offering you the opportunity to grow and develop your professional skills and profile.
  • You will have the opportunity to learn about building a startup business from experienced professionals and serial entrepreneurs.

Application Contact Information

Your application should include a CV and cover letter highlighting your relevant experiences and motivations. Please send this to .

General Information

Pangaea Data is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity or expression, religion, national origin or ancestry, age, disability, marital status, pregnancy, protected veteran status, protected genetic information, political affiliation, or any other characteristics protected by local laws, regulations, or ordinances.

#J-18808-Ljbffr

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer - UK Perm - London Hrbrid

Oracle Java8 Scala Spring Data Engineer London £575d insideIR35

Data Engineer, DE55

Data Engineer - Databricks

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Top 10 Data Science Career Myths Debunked: Key Facts for Aspiring Professionals

Data science has become one of the most sought-after fields in the tech world, promising attractive salaries, cutting-edge projects, and the opportunity to shape decision-making in virtually every industry. From e-commerce recommendation engines to AI-powered medical diagnostics, data scientists are the force behind innovations that drive productivity and improve people’s lives. Yet, despite the demand and glamour often associated with this discipline, data science is also shrouded in misconceptions. Some believe you need a PhD in mathematics or statistics; others assume data science is exclusively about machine learning or coding. At DataScience-Jobs.co.uk, we’ve encountered a wide array of myths that can discourage talented individuals or mislead those exploring a data science career. This article aims to bust the top 10 data science career myths—providing clarity on what data scientists actually do and illuminating the true diversity and inclusiveness of this exciting field. Whether you’re a recent graduate, a professional looking to pivot, or simply curious about data science, read on to discover the reality behind the myths.

Global vs. Local: Comparing the UK Data Science Job Market to International Landscapes

How to evaluate salaries, opportunities, and work culture in data science across the UK, the US, Europe, and Asia Data science has proven to be more than a passing trend; it is now a foundational pillar of modern decision-making in virtually every industry—from healthcare and finance to retail and entertainment. As the volume of data grows exponentially, organisations urgently need professionals who can transform raw information into actionable insights. This high demand has sparked a wave of new opportunities for data scientists worldwide. In this article, we’ll compare the UK data science job market to those in the United States, Europe, and Asia. We’ll explore hiring trends, salary benchmarks, and cultural nuances to help you decide whether to focus your career locally or consider opportunities overseas or in fully remote roles. Whether you’re a fresh graduate looking for your first data science position, an experienced data professional pivoting from analytics, or a software engineer eager to break into machine learning, understanding the global data science landscape can be a game-changer. By the end of this overview, you’ll be better equipped to navigate the expanding world of data science—knowing which skills and certifications matter most, how salaries differ between regions, and what to expect from distinct work cultures. Let’s dive in.

Data Science Leadership for Managers: Strategies to Motivate, Mentor, and Set Realistic Goals in Data-Driven Teams

Data science has become a linchpin in modern business, transforming oceans of raw data into actionable insights that guide strategy, product development, and personalised customer experiences. With this surge in data-centric operations, the need for effective data science leadership has never been more critical. Guiding a team of data scientists, analysts, and machine learning engineers requires not only technical acumen but also the ability to foster collaboration, champion ethical practices, and align complex modelling efforts with overarching business goals. This article provides practical guidance for managers and aspiring leaders aiming to excel in data-driven environments. By exploring strategies to motivate data science professionals, develop mentoring frameworks, and set achievable milestones, you will be better prepared to steer your team towards meaningful, evidence-based outcomes.