Sr. AI Data Engineer (UK Remote)

Turnitin
Newcastle upon Tyne
3 weeks ago
Applications closed
Overview

Turnitin, an AI-focused leader in the educational and research sectors, has been at the forefront of promoting academic integrity and innovation for over two decades. We are renowned for our cutting-edge solutions which have been embraced by thousands of academic institutions, corporations, and publishers worldwide.

We offer remote work as a standard arrangement and value diversity, respecting local cultures and individual choices. We are a remote-first employer with an office in Newcastle (UK) and colleagues across the globe. Our team is diverse yet unified by a shared commitment to making a significant impact in education.

AI and data science are integral to our success and ambitious product roadmap. As a Senior AI Data Engineer, you will join a global team of proactive, supportive, and independent professionals delivering sophisticated, well-structured AI and data systems. You’ll help pioneer our next-generation data and AI pipelines to scale our impact and collaborate with different teams to integrate AI and data science across a broad suite of products that enhance learning, teaching, and academic integrity.

Responsibilities
  • AI Data Infrastructure & Pipeline Management for Applied AI: Design, build, and operate scalable real-time data pipelines that support ongoing Applied AI model training. Deploy and maintain robust data infrastructure using AI techniques and engineering best practices to ensure continuous model improvement and deployment cycles.
  • Data Collection: Execute initiatives for collecting, normalizing, and storing data across multiple sources, including external LLM providers.
  • Collaboration: Partner with AI R&D, Applied AI, and Data Platform teams to ensure seamless data flow and quality standards. Partner with stakeholders to collect, curate, and catalog high-quality datasets that directly support Applied AI retraining workflows and business objectives.
  • AI R&D Support: Provide secondary support to AI Research & Development efforts by applying advanced data warehousing and engineering technologies. Contribute to exploratory data initiatives that uncover insights from Turnitin's extensive data resources.
  • Communication: Maintain clear communication channels across teams, ensuring alignment with company vision while sharing insights on data infrastructure needs and potential innovations.
  • Technology Evolution: Stay current with emerging tools and methodologies in AI data engineering, bringing recommendations to enhance our AI data infrastructure and capabilities.
Qualifications

Required Qualifications:

  • At least 4 years of experience in data engineering, ideally focused on AI/ML data infrastructure or enabling and accelerating AI R&D.
  • Strong proficiency in Python, SQL, and Infrastructure as Code (Terraform, CloudFormation), with additional experience in modern orchestration frameworks (Airflow, Prefect, or dbt).
  • Proficiency with cloud-native data platforms (AWS, Azure, GCP) and vector databases (Pinecone, Weaviate, Qdrant, or Chroma).
  • Experience with MLOps tools and platforms (HuggingFace, SageMaker Bedrock, Vertex AI), experiment tracking (MLflow, Weights & Biases), and model deployment pipelines.
  • Experience with Large Language Models (LLMs), embedding generation, retrieval-augmented generation (RAG) systems, and frameworks for orchestrating LLM interaction (LiteLLM, LangFuse, LangChain, LlamaIndex).
  • Strong problem-solving, analytical, and communication skills, with the ability to design scalable AI data systems and collaborate effectively in cross-functional teams.

Desired Qualifications:

  • 6+ years of experience in data engineering with a focus on AI and machine learning projects.
  • Experience in a technical leadership or mentorship role.
  • Experience in education, EdTech, or academic integrity sectors.
  • Experience using AI coding tools (Cursor, Claude Code, GitHub Copilot) for accelerated development.
  • Familiarity with natural language processing, computer vision, or multimodal AI applications.
  • Experience with data visualization (Streamlit) and data reporting.
Characteristics for Success

As a Senior AI Data Engineer, you should possess:

  • A passion for creatively solving complex data problems.
  • The ability to work collaboratively and cross-functionally.
  • A continuous learning mindset, always striving to improve your skills and knowledge.
  • A proven track record of delivering results and ensuring a high level of quality.
  • Strong written and verbal communication skills.
  • Curiosity about the problems at hand, the field at large, and the best solutions.
  • Strong system-level problem-solving skills
Additional Information

Total Rewards @ Turnitin
Turnitin maintains a Total Rewards package that is competitive within the local job market. Beyond regular pay and bonuses, Turnitin provides generous time off and health and wellness programs that offer choice and flexibility, supporting your well-being. We maintain a remote-centric culture that enables purposeful and accountable work across diverse roles.

Our Mission is to ensure the integrity of global education and meaningfully improve learning outcomes.

Our Values underpin everything we do.

  • Customer Centric - We realize our mission by putting educators and learners at the center of everything we do.
  • Passion for Learning - We seek teammates who are constantly learning and growing and enable that growth.
  • Integrity - Integrity is the heartbeat of Turnitin, shaping our products, interactions, and partnerships.
  • Action & Ownership - We have a bias toward action and empower teammates to make decisions.
  • One Team - We break down silos, collaborate effectively, and celebrate each other’s successes.
  • Global Mindset - We respect local cultures and embrace diversity, thinking globally and acting locally to maximize impact.
Global Benefits
  • Remote First Culture
  • Health Care Coverage*
  • Education Reimbursement*
  • Competitive Paid Time Off
  • 4 Self-Care Days per year
  • National Holidays*
  • 2 Founder Days + Juneteenth Observed
  • Paid Volunteer Time*
  • Charitable contribution match*
  • Monthly Wellness or Home Office Reimbursement/*
  • Access to Modern Health (mental health platform)
  • Parental Leave*
  • Retirement Plan with match/contribution*

* varies by country

Seeing Beyond the Job Ad

At Turnitin, we recognize it’s unrealistic for candidates to fulfill 100% of the criteria in a job ad. We encourage you to apply if you meet the majority of the requirements because we know that skills evolve over time. If you’re willing to learn and evolve alongside us, join our team!

Turnitin, LLC is committed to the policy that all persons have equal access to its programs, facilities and employment. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.

What Hiring Managers Look for First in Data Science Job Applications (UK Guide)

If you’re applying for data science roles in the UK, it’s crucial to understand what hiring managers focus on before they dive into your full CV. In competitive markets, recruiters and hiring managers often make their first decisions in the first 10–20 seconds of scanning an application — and in data science, there are specific signals they look for first. Data science isn’t just about coding or statistics — it’s about producing insights, shipping models, collaborating with teams, and solving real business problems. This guide helps you understand exactly what hiring managers look for first in data science applications — and how to structure your CV, portfolio and cover letter so you leap to the top of the shortlist.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.