Senior Data Engineer

Starcom
City of London
1 month ago
Create job alert
Overview

This role presents an opportunity to engage deeply with MLOps, vector databases, and Retrieval-Augmented Generation (RAG) pipelines - skills that are in incredibly high demand. If you are passionate about shaping the future of AI and thrive on complex, high-impact challenges, we encourage you to apply.

Responsibilities
  • Design and Build Scalable Data Pipelines: Architect, implement, and optimize robust, high-performance real-time and batch ETL pipelines to ingest, process, and transform massive datasets for LLMs and foundational AI models.
  • Cloud-Native Innovation: Leverage your deep expertise across AWS, Azure, and/or GCP to build cloud-native data solutions, ensuring efficiency, scalability, and cost-effectiveness.
  • Power Generative AI: Develop and manage specialized data flows for generative AI applications, including integrating with vector databases and constructing sophisticated RAG pipelines.
  • Champion Data Governance & Ethical AI: Implement best practices for data quality, lineage, privacy, and security, ensuring our AI systems are developed and used responsibly and ethically.
  • Tooling the Future: Get hands-on with cutting-edge technologies like Hugging Face, PyTorch, TensorFlow, Apache Spark, Apache Airflow, and other modern data and ML frameworks.
  • Collaborate and Lead: Partner closely with ML Engineers, Data Scientists, and Researchers to understand their data needs, provide technical leadership, and translate complex requirements into actionable data strategies.
  • Optimize and Operate: Monitor, troubleshoot, and continuously optimize data pipelines and infrastructure for peak performance and reliability in production environments.
What You'll Bring

We are seeking a seasoned professional who is excited by the unique challenges of AI data.

QualificationsWhat are we looking for?Must-Have Skills
  • Extensive Data Engineering Experience: 3+ years designing, building, and maintaining large-scale data pipelines and data warehousing solutions.
  • Cloud Platform Mastery: Expert-level proficiency with at least one major cloud provider (GCP preferred, AWS, or Azure), including their data, compute, and storage services.
  • Programming Prowess: Strong programming skills in Python and SQL.
  • Big Data Ecosystem Expertise: Hands-on experience with Apache Spark, Kafka, and data orchestration tools such as Apache Airflow or Prefect.
  • ML Data Acumen: Solid understanding of data requirements for machine learning models, including feature engineering, data validation, and dataset versioning.
  • Vector Database Experience: Practical experience with vector databases (e.g., Pinecone, Milvus, Chroma) for embedding storage and retrieval.
  • Generative AI Familiarity: Understanding of data paradigms for LLMs, RAG architectures, and how data pipelines support fine-tuning or pre-training.
  • MLOps Principles: Familiarity with MLOps best practices for deploying and managing ML models in production.
  • Data Governance & Ethics: Experience implementing data governance frameworks, ensuring data quality, privacy, and compliance, with awareness of ethical AI considerations.
Bonus Points If You Have
  • Direct experience with Hugging Face ecosystem, PyTorch, or TensorFlow for data preparation in an ML context.
  • Experience with real-time data streaming architectures.
  • Familiarity with containerization (Docker, Kubernetes).
  • Master's or Ph.D. in Computer Science, Data Engineering, or a related quantitative field.
Additional Information

Starcom has fantastic benefits on offer to all of our employees. In addition to the classics, Pension, Life Assurance, Private Medical and Income Protection Plans, we also offer:

  • WORK YOUR WORLD opportunity to work anywhere in the world, where there is a Publicis office, for up to 6 weeks a year.
  • REFLECTION DAYS - Two additional days of paid leave to step away from your usual day-to-day work and create time to focus on your well-being and self-care.
  • HELP@HAND BENEFITS 24/7 helpline to support you on a personal and professional level. Access to remote GPs, mental health support and CBT. Wellbeing content and lifestyle coaching.
  • FAMILY FRIENDLY POLICIES We provide 26 weeks of full pay for the following family milestones: Maternity, Adoption, Surrogacy and Shared Parental Leave.
  • FLEXIBLE WORKING, BANK HOLIDAY SWAP & BIRTHDAY DAY OFF You are entitled to an additional day off for your birthday, from your first day of employment.
  • GREAT LOCAL DISCOUNTS This includes membership discounts with Soho Friends, local restaurants and retailers in Westfield White City and Television Centre.

Full details of our benefits will be shared when you join us.

Publicis Groupe operates a hybrid working pattern with full-time employees being office-based three days during the working week.

We are supportive of all candidates and are committed to providing a fair assessment process. If you have any circumstances (such as neurodiversity, physical or mental impairments or a medical condition) that may affect your assessment, please inform your Talent Acquisition Partner. We will discuss possible adjustments to ensure fairness. Rest assured, disclosing this information will not impact your treatment in our process.

Please make sure you check out the Publicis Career Page which showcases our Inclusive Benefits and our EAGs (Employee Action Groups).


#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.

What Hiring Managers Look for First in Data Science Job Applications (UK Guide)

If you’re applying for data science roles in the UK, it’s crucial to understand what hiring managers focus on before they dive into your full CV. In competitive markets, recruiters and hiring managers often make their first decisions in the first 10–20 seconds of scanning an application — and in data science, there are specific signals they look for first. Data science isn’t just about coding or statistics — it’s about producing insights, shipping models, collaborating with teams, and solving real business problems. This guide helps you understand exactly what hiring managers look for first in data science applications — and how to structure your CV, portfolio and cover letter so you leap to the top of the shortlist.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.