Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Data Engineer

Allegis Global Solutions
London
1 week ago
Create job alert

Overview

GlaxoSmithKline (GSK) is a science-led global healthcare company with a special purpose: to help people do more, feel better, live longer. We are on an audacious journey to impact the health of 2.5 billion people over the next decade. Our R&D division is at the forefront of this mission, dedicated to the discovery and development of groundbreaking vaccines and medicines. We are transforming the landscape of medical research by integrating cutting-edge science and technology and harnessing the power of genetics and new data. By fostering a collaborative environment that unites the talents of our people, we are revolutionizing R&D to pre-empt and defeat diseases. Join us in our commitment to uniting science, technology, and talent to get ahead of disease together.


Position Summary

At GSK we see a world in which advanced applications of Machine Learning and AI will allow us to develop novel therapies to existing diseases and to quickly respond to emerging or changing diseases with personalized drugs, driving better outcomes at reduced cost with fewer side effects. It is an ambitious vision that will require the development of products and solutions at the cutting edge of Machine Learning and AI. We’re looking for a highly skilled Backend Engineer to help us make this vision a reality.

The ideal candidate will have a track record of shipping data products derived from complex sources, responsible for the process from conceptual data pipelines to production scale. We have a commitment to quality, so the person in this role will be able to use modern cloud tooling and techniques to deliver reliable data pipelines and continuously improve them.

This role requires a passion for solving challenging problems aligned to exciting Artificial Intelligence and Machine Learning applications. Educational or professional background in the biological sciences is a plus but is not necessary; passion to help therapies for new and existing diseases, and a pattern of continuous learning and development is mandatory. 

 

Key Responsibilities

  • Build data pipelines using modern data engineering tools on Google Cloud: Python, Spark, SQL, BigQuery, Cloud Storage.
  • Ensure data pipelines meet the specific scientific needs of data consuming applications.
  • Responsible for high quality software implementations according to best practices, including automated test suites and documentation..
  • Develop, measure, and monitor key metrics for all tools and services and consistently seek to iterate on and improve them.
  • Participate in code reviews, continuously improving personal standards as well as the wider team and product.
  • Liaise with other technical staff and data engineers in the team and across allied teams, to build an end-to-end pipeline consuming other data products.

 

Minimum Requirements

  • 2+ years of data engineering experience with a Bachelors degree in a relevant field (including computational, numerate or life sciences), or equivalent experience.
  • Cloud experience (e.g. Google Cloud preferred).
  • Strong skills with industry experience in Python and SQL.
  • Unit testing experience (e.g. pytest).
  • Knowledge of agile practices and able to perform in agile software development environments.
  • Strong experience with modern software development tools / ways of working (e.g. git/GitHub, DevOps tools for deployment).

 

Preferred Requirements

  • Demonstrated experience with biological or scientific data (e.g. genomics, transcriptomics, proteomics), or pharmaceutical industry experience.
  • Bioinformatics expertise, familiarity with large scale bioinformatics datasets.
  • Experience using Nextflow pipelines.
  • Knowledge of NLP techniques and experience of processing unstructured data, using vector stores, and approximate retrieval.
  • Familiarity with orchestration tooling (e.g. Airflow or Google Workflows).
  • Experience with AI/ML powered applications.
  • Experience with Docker or containerized applications.

 

Why GSK?

Uniting science, technology and talent to get ahead of disease together.

GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology).

Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves – feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together.

 

Inclusion at GSK

GSK is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.

If you need any adjustments in the recruitment process, please get in touch with our Recruitment team () to further discuss this today.

 

Important notice to employment businesses/agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.


Please note that if you are aUS Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting For the Record site.

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

The Best Free Tools & Platforms to Practise Data Science Skills in 2025/26

Data science continues to be one of the most exciting, high-growth career paths in the UK and worldwide. From predicting customer behaviour to detecting fraud and driving healthcare innovations, data scientists are at the forefront of digital transformation. But breaking into the field isn’t just about having a degree. Employers are looking for candidates who can demonstrate practical data science skills — analysing datasets, building machine learning models, and presenting insights that solve real business problems. The best part? You don’t need to spend thousands on premium courses or expensive software. There are dozens of high-quality, free tools and platforms that allow you to practise data science in 2025. This guide explores the best ones to help you learn, experiment, and build portfolio-ready projects.

Top 10 Skills in Data Science According to LinkedIn & Indeed Job Postings

Data science isn’t just a buzzword — it’s the engine powering innovation in sectors across the UK, from finance and healthcare to retail and public policy. As organisations strive to turn data into insight and action, the need for well-rounded data scientists is surging. But what precise skills are employers demanding right now? Drawing on trends seen in LinkedIn and Indeed job ads, this article reveals the Top 10 data science skills sought by UK employers in 2025. You’ll get guidance on showcasing these in your CV, acing interviews, and building proof of your capabilities.

The Future of Data Science Jobs: Careers That Don’t Exist Yet

Data science has rapidly evolved into one of the most important disciplines of the 21st century. Once a niche field combining elements of statistics and computer science, it is now at the heart of decision-making across industries. Businesses, governments, and charities rely on data scientists to uncover insights, forecast trends, and build predictive models that shape strategy. In the UK, data science has become central to economic growth. From the NHS using data to improve patient outcomes to financial institutions modelling risk, the applications are endless. The UK’s thriving tech hubs in London, Cambridge, and Manchester are creating high demand for data talent, with salaries often outpacing other technology roles. Yet despite its current importance, data science is still in its infancy. Advances in artificial intelligence, quantum computing, automation, and ethics will transform what data scientists do. Many of the most vital data science jobs of the next two decades don’t exist yet. This article explores why new careers are emerging, the roles likely to appear, how current jobs will evolve, why the UK is well positioned, and how professionals can prepare now.