Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Data Engineer

ConnexAI
Manchester
1 week ago
Create job alert
Build the Future of Conversational AI with ConnexAI

As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating, analysing, and engineering the voice data that fuels our models, you’ll help shape products that transform how people and businesses communicate.

You’ll be part of the team that manages and scales our massive speech corpora, builds automated pipelines for cleaning and validation, and works closely with annotation and Machine Learning teams to keep our models at the cutting edge.

Core Responsibilities
  • Organise and maintain large-scale audio and text corpora, ensuring they are versioned correctly, catalogued, and easy to retrieve.
  • Build automated pipelines using Python, AWS, and Docker to clean, validate, and standardise speech data, detecting duplicates, transcription inconsistencies, or quality issues.
  • Develop and integrate APIs to streamline ingestion and processing of new datasets.
  • Analyse speech datasets to support ASR and TTS model development, performance evaluation, and linguistic research.
  • Implement and manage data version control tools to ensure dataset reproducibility and traceability.
  • Contribute to evaluation frameworks for ASR and TTS performance by analysing metrics such as Word Error Rate (WER), Speaker Similarity (SSim), and Mean Opinion Score (MOS) to generate data-driven insights.
  • Document data processes and tools, ensuring all datasets and analyses are well-documented, reproducible, and compliant with internal standards.
  • Collaborate closely with data scientists, ML engineers, and product teams to identify opportunities to improve data quality, balance, and diversity through targeted analysis and feedback loops.
Key Skills & Experience
  • Strong programming skills in Python for data processing, analysis, and automation.
  • Proficiency with SQL for developing and managing large-scale datasets.
  • Experience with AWS cloud services.
  • Hands‑on experience with Docker and containerised development environments.
  • Familiarity with data versioning tools (e.g., LakeFS, DVC) and dataset reproducibility principles.
  • Strong collaboration and communication skills.
  • Background in speech, audio, or NLP data processing is highly desirable.
Interview Process
  • 30‑minute video call with the team lead
  • Take‑home technical exercise
  • 90‑minute face‑to‑face interview
About ConnexAI

ConnexAI is an award‑winning Conversational AI platform designed by an elite engineering team. ConnexAI’s technology enables organisations to maximise profitability, increase revenue, and take productivity to new levels. ConnexAI provides cutting‑edge, enterprise‑grade AI applications, including AI Agent, AI Guru, AI Analytics, ASR, AI Voice, and AI Quality. We value growth both for our products and our people. As we scale, there will be clear opportunities to progress into senior data science, leadership, or principal research roles. Our high retention rate reflects our inclusive, supportive, and empowering environment.

Seniority level
  • Associate
Employment type
  • Full‑time
Job function
  • Information Technology and Research
Industries
  • Software Development, Data Infrastructure and Analytics, and Research Services


#J-18808-Ljbffr

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Data Science Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK data science hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise rigorous problem framing, high‑quality analytics & modelling, experiment/causality, production awareness (MLOps), governance/ethics, and measurable product or commercial impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for product/data scientists, applied ML scientists, decision scientists, econometricians, growth/marketing analysts, and ML‑adjacent data scientists supporting LLM/AI products. Who this is for: Product/decision/data scientists, applied ML scientists, econometrics & causal inference specialists, experimentation leads, analytics engineers crossing into DS, ML generalists with strong statistics, and data scientists collaborating with platform/MLOps teams in the UK.

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data science once meant advanced statistics, machine learning models and coding in Python or R. In the UK today, it has become one of the most in-demand professions across sectors — from healthcare to finance, retail to government. But as the field matures, employers now expect more than technical modelling skills. Modern data science is multidisciplinary. It requires not just coding and algorithms, but also legal knowledge, ethical reasoning, psychological insight, linguistic clarity and human-centred design. Data scientists are expected to interpret, communicate and apply data responsibly, with awareness of law, human behaviour and accessibility. In this article, we’ll explore why data science careers in the UK are becoming more multidisciplinary, how these five disciplines intersect with data science, and what job-seekers & employers need to know to succeed in this transformed field.

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Data science is one of the most in-demand, dynamic, and multidisciplinary areas in the UK tech and business landscape. Organisations from finance, retail, health, government, and beyond are using data to drive decisions, automate processes, personalise services, predict trends, detect fraud, and more. To do that well, companies don’t just need good data scientists; they need teams with clearly defined roles, responsibilities, workflows, collaboration, and governance. If you're aiming for a role in data science or recruiting for one, understanding the structure of a data science department—and who does what—can make all the difference. This article breaks down the key roles, how they interact across the lifecycle of a data science project, what skills and qualifications are typical in the UK, expected salary ranges, challenges, trends, and how to build or grow an effective team.