Engineer the Quantum RevolutionYour expertise can help us shape the future of quantum computing at Oxford Ionics.

View Open Roles

Staff Data Engineer, AI Evaluation

Wayve
London
1 month ago
Create job alert

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.

About us

Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.

Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.

In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.

At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.

Make Wayve the experience that defines your career!

Wayve's machine learning-first approach relies on high-quality, well-structured data. The Evaluation Workflows and Measurement teams build tools and pipelines that power model evaluation at scale. As we scale our evaluation approaches and tooling, we need to process massive volumes of test data efficiently and reliably.

This Data Engineer will be embedded in the AI Evaluation division to ensure our evaluation and analytics pipelines are robust, performant, and future-proof. Their work will strengthen our data foundations for fast decision-making, accelerate the availability of large-scale image and video analytics, and help us rapidly integrate and leverage data from external partners - enabling faster iteration across both offline and on-road evaluation.

Challenges you will own

  • Build scalable and reliable data and analytics pipelines to process and enrich over 1 million hours of driving video data annually and supply mission-critical data to stakeholders across the business.
  • Unlock rapid insights by architecting and optimising analytics pipelines that drive company wide development and decision-making.
  • Collaborate across functions - including research engineers, simulation experts, robotics engineers, data scientists and safety drivers - to deliver and visualise enriched data.
  • Improve pipeline observability, validation, and fault tolerance for production-grade robustness.
  • Enable LLM-driven workflows by shaping data to be AI-consumable (e.g. chunking, embeddings, metadata).
  • Reduce tech debt and simplify orchestration across Flyte, Databricks, and Azure-based infrastructure.

Example Projects:

  • Design and optimise distributed data pipelines to handle large-scale video and image data processing.
  • Re-design and optimise existing analytics pipelines.
  • Collaborate with the data platform team to integrate pipelines with Databricks for governance and compliance - and unlock massive scale for offline evaluation from third party datasets.
  • Shape evaluation data to support future use cases like Retrieval-Augmented Generation (RAG) and natural language analytics.

What we are looking for in our candidate

  • Proficiency in Python and SQL, with experience in frameworks like Pandas, PySpark, and NumPy for large-scale data processing.
  • Expertise in debugging and optimising distributed systems with a focus on scalability and reliability.
  • Proven ability to design and implement scalable, fault-tolerant ETL pipelines with minimal manual intervention.
  • Knowledge of data modelling best practices, including the medallion architecture or comparable frameworks.
  • Experience in workflow orchestration using Flyte, dbt, Airflow, or Prefect.
  • Strong understanding of unit, integration, and data validation testing using tools like Pytest or Great Expectations.
  • Familiarity with cloud infrastructure (preferably Azure) for managing pipelines and storage
  • Ability to collaborate closely with stakeholders to understand requirements and shape data pipelines to meet user needs effectively.
  • 5+ years of experience in a data engineering or similar role
  • Experience with Docker, Kubernetes, Databricks
  • Familiarity with shaping data for AI/LLM-based systems

This is a full-time role based in our office in London. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.

#LI-FH1

We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.

DISCLAIMER: We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.

Apply for this job

*

indicates a required field

First Name *

Last Name *

Email *

Phone *

Location (City) *

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Education

School Select...

Degree Select...

Select...

LinkedIn Profile *

When are you available to start? *

Do you require sponsorship? * Select...

What is your preferred pronoun?

Learn more about how we handle your data for recruiting purposes in our privacy notice * Select...


#J-18808-Ljbffr

Related Jobs

View all jobs

Staff Data Engineer, AI Evaluation London

Staff Data Scientist

Senior Data Scientist

Project Controls Senior Data Analyst

Senior Data Scientist

Data Scientist (Machine Learning Observability & Governance)

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Pre-Employment Checks for Data Science Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in data science reflects the discipline's unique position at the intersection of statistical analysis, machine learning innovation, and strategic business intelligence. Data scientists often have privileged access to comprehensive datasets, proprietary algorithms, and business-critical insights that form the foundation of organisational strategy and competitive positioning. The data science industry operates within complex regulatory frameworks spanning GDPR, sector-specific data protection requirements, and emerging AI governance regulations. Data scientists must demonstrate not only technical competence in statistical modelling and machine learning but also deep understanding of research ethics, data privacy principles, and the societal implications of algorithmic decision-making. Modern data science roles frequently involve analysing personally identifiable information, financial data, healthcare records, and sensitive business intelligence across multiple jurisdictions and regulatory frameworks simultaneously. The combination of analytical privilege, predictive capabilities, and strategic influence makes thorough candidate verification essential for maintaining compliance, security, and public trust in data-driven insights and automated systems.

Why Now Is the Perfect Time to Launch Your Career in Data Science: The UK's Analytics Revolution

The United Kingdom stands at the forefront of a data science revolution that's reshaping how businesses make decisions, governments craft policies, and society tackles its greatest challenges. From the machine learning algorithms powering London's fintech innovation to the predictive models guiding Manchester's smart city initiatives, Britain's transformation into a data-driven economy has created an unprecedented demand for skilled data scientists that far outstrips the available talent. If you've been contemplating a career transition or seeking to position yourself at the cutting edge of the digital economy, data science represents one of the most intellectually stimulating, financially rewarding, and socially impactful career paths available today. The convergence of big data maturation, artificial intelligence mainstream adoption, business intelligence evolution, and cross-industry digital transformation has created the perfect conditions for data science career success.

Automate Your Data Science Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

Data science roles land daily across banks, product companies, consultancies, scaleups & the public sector—often buried in ATS portals or duplicated across boards. The fix: put discovery on rails with keyword-rich alerts, RSS feeds & a reusable ChatGPT workflow that triages listings, ranks fit, & tailors your CV in minutes. This copy-paste playbook is for www.datascience-jobs.co.uk readers. It’s UK-centric, practical, & designed to save you hours each week. What You’ll Have Working In 30 Minutes A role & keyword map spanning Core DS, Applied/Research, Product/Decision Science, NLP/CV, Causal/Experimentation, Time Series/Forecasting, MLOps-adjacent & Analytics Engineering overlaps. Shareable Boolean searches for Google & job boards that strip out noise. Always-on alerts & RSS feeds that bring fresh UK roles to you. A ChatGPT “Data Science Job Scout” prompt that deduplicates, scores match & outputs ready-to-paste actions. A simple pipeline tracker so deadlines & follow-ups never slip.