Data Scientist II, RufusX Science UK

Amazon
London, United Kingdom
Last month
Job Type
Permanent
Posted
3 Mar 2026 (Last month)
We are looking for a passionate, talented, and inventive Data Scientist with a strong machine learning and analytics background to help build industry-leading language technology powering Rufus, our AI-driven search and shopping assistant, helping customers with their shopping tasks at every step of their shopping journey.

This innovative role focuses on developing and optimizing large language model (LLM)-powered conversational experiences. The core emphasis is to get the best performance out of state-of-the-art LLMs via careful and methodical instruction design, contextual grounding, informed choices of MCP tools and agent/multi-agent systems, evaluation frameworks, and experimentation to systematically improve LLM quality, robustness, and customer impact. The work combines scientific rigor with product intuition to systematically raise the bar for conversational AI performance at Amazon scale.

Our mission in conversational shopping is to make it easy for customers to find and discover the best products to meet their needs by helping with their product research, providing comparisons and recommendations, answering product questions, enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data-driven insights to continuously improve our systems.

Key job responsibilities
As a Data Scientist on our team, you will develop and maintain LLM instructions iterations and evaluation frameworks, including automated eval pipelines, LLM-as-a-judge methodologies, rubric design, and dataset curation to measure nuanced aspects of response quality. You will partner with the wider org to experiment with techniques such as retrieval augmentation, context enrichment, prompt decomposition, and model fine-tuning or post-training strategies, if and when applicable. You will leverage petabytes of data and identify opportunities to leverage machine learning models aimed at making conversational systems more performant.

A day in the life
You will:
Perform hands-on analysis of large-scale multimodal interaction datasets to develop insights into how customers engage with conversational AI systems and how to improve response quality and customer experience. Use statistical methods, experimentation, and data-driven analysis to develop scalable approaches for measuring, evaluating, and optimizing large language model (LLM)-based shopping assistant systems, leveraging structured and unstructured contextual signals. Design and analyze A/B tests and experiments to evaluate new features and model improvements, ensuring statistical rigor and actionable insights. Develop metrics, dashboards, and reporting frameworks to monitor system performance, customer engagement, and business impact. Conduct deep-dive analyses to identify opportunities for improving conversational relevance, grounding, customer satisfaction, and downstream business impact. Collaborate with Applied Scientists and Engineers to translate analytical insights into production systems, working closely on model evaluation and deployment. Establish automated processes for large-scale data analysis, ETL pipelines, metric generation, and experimentation frameworks. Communicate results and insights to both technical and non-technical audiences, including through presentations, written reports, and data visualizations.

About the team
The Rufus Features Science team, based in London, works alongside ~150 engineers, designers and product managers, shaping the future of AI-driven shopping experiences at Amazon. The team works on every aspect of the Rufus AI, from making Rufus agentic, enabling customers to set price alerts or empower Rufus to act on their behalf and automatically purchase products when the price is right, to understanding multimodal user queries and generating answers that combine text, image, audio and video, including deep research reports that scour the web and the Amazon catalog to provide detailed and personalised shopping guidance. We utilize and advance state-of-art techniques in the fields of Natural Language Processing, gen AI, Information Retrieval, Machine/Deep Learning, and Data Mining. We validate our work by actively participating in the internal and external scientific communities.

Related Jobs

View all jobs

Data Scientist

Randstad Technologies Recruitment London, United Kingdom

Data Scientist

Hays Technology London, United Kingdom
£600 – £1,000 pd

Data Scientist

Faculty London, United Kingdom
Hybrid

Data Scientist

Data Idols Farringdon, Greater London, London, EC1M 4BJ, United Kingdom
£85,000 – £95,000 pa

Data Scientist

Vallum Associates London, United Kingdom

Data Scientist

Access Computer Consulting City of London, United Kingdom

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Where to Advertise Data Science Jobs in the UK (2026 Guide)

Advertising data science jobs in the UK requires a different approach to most technical hiring. Data science spans a broad and often misunderstood spectrum — from statistical modelling and experimental design through to machine learning engineering, product analytics and AI research. The strongest candidates identify firmly with specific subdisciplines and are frustrated by adverts that conflate data scientist with data analyst, business intelligence developer or machine learning engineer. General job boards produce high application volumes for data roles but consistently fail to match specialist data science profiles with the right opportunities. This guide, published by DataScienceJobs.co.uk, covers where to advertise data science roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

New Data Science Employers to Watch in 2026: UK and International Companies Leading Analytics and AI Innovation

Data science has emerged as one of the most transformative forces across industries, turning raw information into actionable insights, predictive models, and AI-powered solutions. In 2026, the UK is witnessing a surge in organisations where data science is not just a support function but the core of their products and services. For professionals exploring opportunities on www.DataScience-Jobs.co.uk , identifying these employers early can provide a competitive advantage in a market with high demand for advanced analytics and machine learning expertise. This article highlights new and high-growth data science employers to watch in 2026, focusing on UK startups, scale-ups, and global firms expanding their data science operations locally. All of the companies included have recently raised investment, won high-profile contracts, or significantly scaled their analytics teams.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.