Data Scientist (ML, Speech, NLP & Multimodal Expertise) | Manchester

Transperfect Gaming Solutions

Manchester

1 week ago

Create job alert

Overview

We are looking to hire a Data Scientist with strong expertise in machine learning, speech and language processing, and multimodal systems. This role is essential to driving our product roadmap forward, particularly in building out our core machine learning systems and developing next-generation speech technologies. The ideal candidate will be capable of working independently while effectively collaborating with cross-functional teams, and will be curious, experimental, and communicative.

Key Responsibilities

Create maintainable, elegant code and high-quality data products that are modeled, well-documented, and simple to use.
Build, maintain, and improve the infrastructure to extract, transform, and load data from a variety of sources using SQL, Azure, GCP and AWS technologies.
Perform statistical analysis of training datasets to identify biases, quality issues, and coverage gaps.
Implement automated evaluation pipelines that scale across multiple models and tasks.
Create interactive dashboards and visualization tools for model performance analysis.

Additional Responsibilities

Design and implement robust data ingestion pipelines for massive-scale text and speech corpora including automated data preprocessing and cleaning pipelines.
Create data validation frameworks and monitoring systems for dataset quality.
Develop sampling strategies for balanced and representative training data.
Implement comprehensive experiment tracking and hyperparameter optimization frameworks.
Conduct statistical analysis of training dynamics and convergence patterns.
Design A/B testing frameworks for comparing different training approaches.
Create automated model selection pipelines based on multiple evaluation criteria.
Develop cost-benefit analyses for different training configurations.
Design comprehensive benchmark suites with statistical significance testing.
Develop fairness metrics and bias detection systems.
Build real-time monitoring systems for model performance in production.
Implement feature drift detection and data quality monitoring.
Design feedback loops to capture user interactions and model effectiveness.
Create automated retraining pipelines based on performance degradation signals.
Develop business metrics and ROI analysis for model deployments.

Required Skills, Experience and QualificationsProgramming & Software Engineering

Python (Expert Level): Advanced proficiency in scientific computing stack (NumPy, Pandas, SciPy, Scikit-learn).
Version Control: Git workflows, collaborative development, and code review processes.
Software Engineering Practices: Testing frameworks, CI/CD pipelines, and production-quality code development.

Machine Learning and Language Model Expertise

Traditional Machine Learning and Deep Learning Knowledge: Proficiency in classical ML algorithms (Naive Bayes, SVM, Random Forest, etc.) and Deep Learning architectures.
Understanding of Transformer Architecture: Attention mechanisms, positional encoding, and scaling laws.
Training Pipeline Knowledge: Data preprocessing for large corpora, tokenization strategies, and distributed training concepts.
Evaluation Frameworks: Experience with standard NLP benchmarks (GLUE, SuperGLUE, etc.) and custom evaluation design.
Fine-tuning Techniques: Understanding of PEFT methods, instruction tuning, and alignment techniques.
Model Deployment: Knowledge of model optimization, quantization, and serving infrastructure for large models.

Collaboration & Adaptability

Strong communication skills are a must
Self-reliant but knows when to ask for help
Comfortable working in an environment where conventional development practices may not always apply
PBIs (Product Backlog Items) may not be highly detailed
Experimentation will be necessary
Ability to identify what's important in completing a task or partial task and explain/justify their approach
Can effectively communicate ideas and strategies
Proactive and takes initiative rather than waiting for PBIs to be assigned when circumstances call for it
Strong interest in AI and its possibilities, a genuine passion for certain areas can provide that extra spark
Curious and open to experimenting with technologies or languages outside their comfort zone

Mindset & Work Approach

Takes ownership when things don't go as planned
Capable of working from high-level explanations and general guidance on implementations and final outcomes
Continuous, clear communication is crucial, detailed step-by-step instructions may not always be available
Self-starter, self-motivated, and proactive in problem-solving
Enjoys exploring and testing different approaches, even in unfamiliar programming languages

Additional Skills, Experience and QualificationsMachine Learning & Deep Learning

Framework Proficiency: Scikit-learn, XGBoost, PyTorch (preferred) or TensorFlow for model implementation and experimentation.
MLOps Expertise: Model versioning, experiment tracking, model monitoring (MLflow, Weights & Biases), data monitoring and validation (Great Expectations, Prometheus, Grafana), and automated ML pipelines (GitHub CI/CD, Jenkins, CircleCI, GitLab etc.).
Statistical Modeling: Hypothesis testing, experimental design, causal inference, and Bayesian statistics.
Model Evaluation: Cross-validation strategies, bias-variance analysis, and performance metric design.
Feature Engineering: Advanced techniques for text, time-series, and multimodal data.

Data Engineering & Infrastructure

Big Data Technologies: Spark (PySpark), Hadoop ecosystem, and distributed computing frameworks (DDP, TP, FSDP).
Cloud Platforms: AWS (SageMaker, S3, EMR), GCP (Vertex AI, BigQuery), or Azure ML.
Database Systems: NoSQL databases (MongoDB, Elasticsearch), graph databases (Neo4j), and vector databases (Pinecone, Milvus, ChromaDB, FAISS etc.).
Data Pipeline Tools: Airflow, Prefect, or similar orchestration frameworks.
Containerization: Docker, Kubernetes for scalable model deployment

#J-18808-Ljbffr

Related Jobs

View all jobs

Data Scientist

Data Scientist - Palantir

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Sep 24, 2025

Jobs

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data science once meant advanced statistics, machine learning models and coding in Python or R. In the UK today, it has become one of the most in-demand professions across sectors — from healthcare to finance, retail to government. But as the field matures, employers now expect more than technical modelling skills. Modern data science is multidisciplinary. It requires not just coding and algorithms, but also legal knowledge, ethical reasoning, psychological insight, linguistic clarity and human-centred design. Data scientists are expected to interpret, communicate and apply data responsibly, with awareness of law, human behaviour and accessibility. In this article, we’ll explore why data science careers in the UK are becoming more multidisciplinary, how these five disciplines intersect with data science, and what job-seekers & employers need to know to succeed in this transformed field.

Sep 20, 2025

Jobs

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Data science is one of the most in-demand, dynamic, and multidisciplinary areas in the UK tech and business landscape. Organisations from finance, retail, health, government, and beyond are using data to drive decisions, automate processes, personalise services, predict trends, detect fraud, and more. To do that well, companies don’t just need good data scientists; they need teams with clearly defined roles, responsibilities, workflows, collaboration, and governance. If you're aiming for a role in data science or recruiting for one, understanding the structure of a data science department—and who does what—can make all the difference. This article breaks down the key roles, how they interact across the lifecycle of a data science project, what skills and qualifications are typical in the UK, expected salary ranges, challenges, trends, and how to build or grow an effective team.

Sep 17, 2025

Jobs

Why the UK Could Be the World’s Next Data Science Jobs Hub

Data science is arguably the most transformative technological field of the 21st century. From powering artificial intelligence algorithms to enabling complex business decisions, data science is essential across sectors. As organisations leverage data more rapidly—from retailers predicting customer behaviour to health providers diagnosing conditions—demand for proficiency in data science continues to surge. The United Kingdom is particularly well-positioned to become a global data science jobs hub. With world-class universities, a strong tech sector, growing AI infrastructure, and supportive policy environments, the UK is poised for growth. This article delves into why the UK could emerge as a leading destination for data science careers, explores the job market’s current state, outlines future opportunities, highlights challenges, and charts what must happen to realise this vision.

Data Scientist (ML, Speech, NLP & Multimodal Expertise) | Manchester

Related Jobs

Data Scientist

Data Scientist

Data Scientist

Data Scientist

Data Scientist

Data Scientist - Palantir

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

Industry Insights

Why Data Science Careers in the UK Are Becoming More Multidisciplinary

Data Science Team Structures Explained: Who Does What in a Modern Data Science Department

Why the UK Could Be the World’s Next Data Science Jobs Hub

Find the perfect job? Subscribe to job alerts to stay informed about new opportunities.