Graduate Data Engineer

SRG
Marlow
1 month ago
Create job alert

SRG are working with a leading pharmaceutical company based in Marlow. Our client develops and manufacture an impressive portfolio of aesthetics brands and products. Our client is committed to driving innovation and providing high-quality products and services.


Role Overview

As a Graduate Data Engineer, you will build and maintain scalable data pipelines in Palantir Foundry for advanced reporting and analytics while collaborating with cross-functional teams as part of the BTS Data & Analytics team. You will work closely with key stakeholders in Engineering, Product, GTM, and other groups to help build scalable data solutions that support key metrics, reporting, and insights. You will assist in ensuring teams have access to reliable, accurate data as our company grows. You will have the opportunity to support projects that enable self-serve insights, helping teams make data-driven decisions, while learning from experienced team members and developing your technical and business skills.


Key Responsibilities

  • Build and maintain data pipelines, leveraging PySpark and/or Typescript within Foundry, to transform raw data into reliable, usable datasets. Familiarity with Palantir Foundry, PySpark, Kafka, TypeScript, PowerBI preferable.
  • Assist in preparing and optimizing data pipelines to support machine learning and AI model development, ensuring datasets are clean, well-structured, and readily usable by Data Science teams.
  • Support the integration and management of feature engineering processes and model outputs into Foundry's data ecosystem, helping enable scalable deployment and monitoring of AI/ML solutions as you develop your skills in this area.
  • Engaged in gathering and translating stakeholder requirements for key data models and reporting, with a focus on Palantir Foundry workflows and tools.
  • Participate in developing and refining dashboards and reports in Foundry to visualize key metrics and insights as you grow your data visualization skills.
  • Collaborate with Product, Engineering, and GTM teams to align data architecture and solutions, learning to support scalable, self-serve analytics across the organization.
  • Have some prompt engineering experience with large language models, including writing and evaluating complex multi-step prompts.
  • Continuously develop your understanding of the company’s data landscape, including Palantir Foundry’s ontology-driven approach and best practices for data management.

About you

  • You have a degree in Computer Science, Engineering, Mathematics, or similar, or have similar work experience.
  • Having up to 2 years of experience building data pipelines at work or through internships is helpful.
  • You can write clear and reliable Python/PySpark code.
  • You are familiar with popular analytics tools (like pandas, numpy, matplotlib), big data frameworks (like Spark), and cloud services (like Palantir, AWS, Azure, or Google Cloud).
  • You have a deep understanding of data models, relational and non-relational databases, and how they are used to organize, store, and retrieve data efficiently for analytics and machine learning.
  • Knowing about software engineering methods, including DevOps, DataOps, or MLOps, is also a plus.

You will be considered a strong fit if you have

  • Master’s degree in engineering (such as AI/ML, Data Systems, Computer Science, Mathematics, Biotechnology, Physics), or minimum 2 years of relevant technology experience.
  • Experience with Generative AI (GenAI) and agentic systems will be considered a strong plus.
  • Have a proactive and adaptable mindset: willing to take initiative, learn new skills, and contribute to different aspects of a project as needed to drive solutions from start to finish, even beyond the formal job description.
  • Show a strong ability to thrive in situations of ambiguity, taking initiative to create clarity for yourself and the team, and proactively driving progress even when details are uncertain or evolving.
  • Hybrid working policy: Currently, our client expects all staff to be in their Marlow-based office at least 3 days a week from Jan 2026.

Seniority level

Entry level


Employment type

Full-time


Job function

Information Technology


Industries

Pharmaceutical Manufacturing


#J-18808-Ljbffr

Related Jobs

View all jobs

Graduate Data Engineer

Graduate Data Engineer - Hybrid, Insights & Automation

Graduate Data Engineer

Graduate Data Engineer

Graduate Data Engineer

Graduate Data Engineer: Azure Data Platform Explorer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.

What Hiring Managers Look for First in Data Science Job Applications (UK Guide)

If you’re applying for data science roles in the UK, it’s crucial to understand what hiring managers focus on before they dive into your full CV. In competitive markets, recruiters and hiring managers often make their first decisions in the first 10–20 seconds of scanning an application — and in data science, there are specific signals they look for first. Data science isn’t just about coding or statistics — it’s about producing insights, shipping models, collaborating with teams, and solving real business problems. This guide helps you understand exactly what hiring managers look for first in data science applications — and how to structure your CV, portfolio and cover letter so you leap to the top of the shortlist.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.