Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

ML Data Engineer

Recraft
City of London
6 days ago
Create job alert
About Us

Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.


We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we’re just getting started.


Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.


If you’re passionate about pushing the boundaries of AI, we want you on board!


Job Description

At Recraft, we’re building the next generation of generative models across images and text. We’re looking for an ML Data Engineer to scale our data pipelines for unstructured data (primarily images) and keep our training flows fast, reliable, and repeatable. You’ll design and operate high-throughput ingestion and preprocessing on Kubernetes, evolve our internal data-pipeline framework, and work hand-in-hand with ML engineers to ship datasets that move model quality forward.


Key Responsibilities

  • Build robust crawlers/scrapers to collect large-scale image (and occasional text/HTML) datasets from diverse sources.


  • Own the end-to-end flow: raw data → quality/beauty/relevance filtering → dedup/validation → ready-to-train artifacts.
    Operate and improve our Kubernetes-based data-pipeline framework (distributed jobs, retries, monitoring, automation).


  • Work with S3-style object storage: efficient layouts, lifecycle, throughput, and cost awareness.


  • Add tooling around pipelines (progress/health visualization, metrics, alerts) for observability and faster iteration.


  • Collaborate closely with ML engineers to align datasets with training needs and accelerate experimentation.



Requirements

Must-have



  • Strong Python fundamentals; you write clean, maintainable, production-ready code.


  • Solid hands-on Kubernetes experience (containers, jobs, batch/distributed processing).


  • Proven track record with unstructured data, especially images (loading, filtering, transforming at scale).


  • Experience building web crawlers/parsers and handling real-world failure modes gracefully.


  • Comfort with S3/object storage and moving lots of data efficiently and safely.


  • Pragmatic, detail-oriented, ownership mindset; you enjoy making systems reliable and fast.



Nice-to-have



  • Familiarity with ML workflows (PyTorch) and downstream training considerations.


  • Experience with image quality scoring, captioning, or image-to-text pipelines.


  • DAG/workflow visualizations or pipeline UX tooling.


  • DevOps fluency: Docker, CI/CD, infra automation.



What We Offer

  • Competitive salary and equity.


  • We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.


  • Real impact on model quality: your pipelines directly power training runs and product improvements.


  • Ownership with support: autonomy to design and improve systems, alongside experienced ML peers.


  • Modern stack: Python, Kubernetes, S3, internal pipeline framework built for scale.


  • Growth: a fast-moving environment where shipping well-engineered systems is the norm.



#J-18808-Ljbffr

Related Jobs

View all jobs

AI/ML Data Engineer: Production Models on Azure

GCP AI/ML Data Engineer — Marketing Automation (London)

Senior AI/ML Data Engineer - 100% Remote - EMEA

Google Cloud AI/ML Data Engineer

Data Engineer

Sr. AI Data Engineer (UK Remote)

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Data Science Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK data science hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise rigorous problem framing, high‑quality analytics & modelling, experiment/causality, production awareness (MLOps), governance/ethics, and measurable product or commercial impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for product/data scientists, applied ML scientists, decision scientists, econometricians, growth/marketing analysts, and ML‑adjacent data scientists supporting LLM/AI products. Who this is for: Product/decision/data scientists, applied ML scientists, econometrics & causal inference specialists, experimentation leads, analytics engineers crossing into DS, ML generalists with strong statistics, and data scientists collaborating with platform/MLOps teams in the UK.

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data science once meant advanced statistics, machine learning models and coding in Python or R. In the UK today, it has become one of the most in-demand professions across sectors — from healthcare to finance, retail to government. But as the field matures, employers now expect more than technical modelling skills. Modern data science is multidisciplinary. It requires not just coding and algorithms, but also legal knowledge, ethical reasoning, psychological insight, linguistic clarity and human-centred design. Data scientists are expected to interpret, communicate and apply data responsibly, with awareness of law, human behaviour and accessibility. In this article, we’ll explore why data science careers in the UK are becoming more multidisciplinary, how these five disciplines intersect with data science, and what job-seekers & employers need to know to succeed in this transformed field.

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Data science is one of the most in-demand, dynamic, and multidisciplinary areas in the UK tech and business landscape. Organisations from finance, retail, health, government, and beyond are using data to drive decisions, automate processes, personalise services, predict trends, detect fraud, and more. To do that well, companies don’t just need good data scientists; they need teams with clearly defined roles, responsibilities, workflows, collaboration, and governance. If you're aiming for a role in data science or recruiting for one, understanding the structure of a data science department—and who does what—can make all the difference. This article breaks down the key roles, how they interact across the lifecycle of a data science project, what skills and qualifications are typical in the UK, expected salary ranges, challenges, trends, and how to build or grow an effective team.