Research Engineer/Research Scientist – Model Transparency

AI Security Institute

London, United Kingdom

Last month

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Mid
Education: Degree
Security Clearance: Required
Posted: 24 Apr 2026 (Last month)

Benefits

Security clearance

Save job

Create job alert

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Mid
Education: Degree
Security Clearance: Required
Posted: 24 Apr 2026 (Last month)

Benefits

Security clearance

Save job

Create job alert

About the AI Security Institute

The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally.

We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action.

The deadline for applying to this role is Sunday 24th May 2026, end of day, anywhere on Earth.

Team Description

The ability to effectively evaluate and monitor AI systems will grow in importance as models become more capable, autonomous, and integrated into society. If models can detect and game evaluations, obscure their reasoning, or behave differently under observation, the safety claims that governments and developers rely on become unreliable. Understanding and addressing these risks is essential to ensuring that oversight of advanced AI systems keeps pace with their capabilities.

The Model Transparency team is a research team within AISI focused on ensuring that evaluations, assessments, and monitoring of frontier AI systems remain reliable as models become less transparent. We research how and why oversight is declining – through phenomena such as evaluation awareness, unfaithful chain-of-thought reasoning, and changes in model architectures – and develop methods (including white and black box methods) to detect, measure, and mitigate potential issues. We share our findings with frontier AI companies (including Anthropic, OpenAI, DeepMind), UK government officials, and allied governments, and publicly to inform their deployment, research, and policy decisions. We also work directly with safety teams at frontier labs, contributing to safety case reviews and helping improve their alignment evaluation methodology.

Our recent work includes auditing games for sandbagging, reproducing natural emergent misalignment from reward hacking, and identifying open-weight language models that game propensity evaluations.

Role description

We're looking for Research Scientists and Research Engineers for the Model Transparency team with expertise in technical AI safety – such as interpretability, capability or alignment evaluations, model transparency – or with broader experience with frontier LLM research and development. An ideal candidate would have a strong track record of high-quality research in technical AI safety or adjacent fields.

Research Scientists, drive the technical substance of our work – staying abreast of the literature, proposing and designing experiments, conducting rigorous analyses, and owning the evidence stack from experiment through to written output. They write, critique, and strengthen the team's reports and publications.
Research Engineers, build the systems and tooling that make our research possible and fast – scaling experimental workflows, automating processes, solving infrastructure challenges, and creating systems that accelerate the entire team's output.

We're interested in candidates along the spectrum between Research Engineers and Research Scientists. The application form will ask you to indicate which role you lean towards.

The team is led by Joseph Bloom, advised by Geoffrey Irving. You'll work with talented, mission-driven technical staff across AISI, including alumni from Anthropic, OpenAI, DeepMind, and top universities. You may also collaborate with external research teams including those at frontier AI labs, METR, and FAR.

We are open to hires across a range of experience levels.

Representative Projects You Might Work On

Developing a chain-of-thought monitorability benchmark and comparing monitorability properties across frontier AI systems, leveraging AISI’s unique access to reasoning traces from multiple labs.
Designing and running experiments on open-weight models to study alignment and oversight-relevant phenomena – such as reproducing emergent misalignment from reward hacking, or red-teaming techniques like inoculation prompting and character training.

Using white-box and interpretability methods – such as activation oracles, sparse auto-encoders or probes – to detect misalignment that isn’t visible through behavioural evaluation alone.
Building tooling and infrastructure for our research – including agent orchestration, large-scale RL pipelines, mechanistic interpretability methodologies, and auditing agents.

The work could also involve:

Reviewing frontier lab risk assessments and safety cases, providing independent analysis of alignment claims before deployment decisions.
Conducting literature reviews and expert interviews to map the state of model transparency risks and inform AISI’s strategic priorities.
Translating technical findings into actionable insights for AISI evaluation teams, UK government officials, and international partners.

What we’re looking for

If you’re unsure whether you meet the criteria below, we’d encourage you to apply anyway – we’d rather you erred on the side of applying than not.

Requirements for both roles:

Related Jobs

View all jobs

Software Engineer (Data Services), London

Isomorphic Labs London, United Kingdom

On-site

Research Engineer - Societal Impacts

AI Security Institute London, United Kingdom

On-site Clearance Required

Research Engineer - Societal Impacts

AI Security Institute London, United Kingdom

On-site Clearance Required

ML Research Engineer, London

Isomorphic Labs London, United Kingdom

On-site

Senior RF Data Scientist / Research Engineer

Adria Solutions Cambridge, United Kingdom

£80,000 – £110,000 pa Hybrid

Research Scientist (Applied LLMs), London

Isomorphic Labs London, United Kingdom

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

May 26, 2026

Jobs

Quant Researcher Jobs in London 2026: Hedge Fund Pay for Data Scientists

Quant researcher jobs in London 2026: total comp £150k–£2M+, top hedge funds and prop firms hiring, and how the role differs from data science.

Apr 9, 2026

Products

Where to Advertise Data Science Jobs in the UK (2026 Guide)

Where to advertise data science jobs UK in 2026: the specialist boards, communities and channels that actually reach senior and lead data science talent. Data science spans a broad and often misunderstood spectrum — from statistical modelling and experimental design through to machine learning engineering, product analytics and AI research. The strongest candidates identify firmly with specific subdisciplines and are frustrated by adverts that conflate data scientist with data analyst, business intelligence developer or machine learning engineer. General job boards produce high application volumes for data roles but consistently fail to match specialist data science profiles with the right opportunities. This guide, published by DataScienceJobs.co.uk, covers where to advertise data science roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

Apr 5, 2026

Jobs

Data Science Jobs UK 2026: What to Expect Over the Next 3 Years

Data Science Jobs UK 2026: roles, salaries and the trends shaping UK data science hiring over the next three years — from MLE crossover to GenAI workflows. Data science has spent the past decade being described as the sexiest job of the twenty-first century. By 2026, the reality is both more nuanced and more interesting than that label ever suggested. The discipline has matured, fragmented, deepened, and in some respects reinvented itself — and the jobs market has changed with it in ways that create genuine opportunity for those who understand what employers actually want, and genuine difficulty for those still operating on assumptions formed five years ago. The data science jobs market of 2026 is not simply a larger version of what it was three years ago. The generalist data scientist — equally comfortable wrangling data, building models, and presenting insights to the board — is giving way to a more specialised landscape where employers know exactly what problem they are trying to solve and are looking for candidates with the specific depth to solve it. Machine learning engineering, causal inference, experimentation, AI product development, and domain-specific applied science have all emerged as distinct career tracks within what was previously a single, loosely defined profession. At the same time, the arrival of large language models and the broader AI capability wave has both threatened and created data science roles in equal measure. Some of the work that junior data scientists spent their early careers doing — data cleaning, exploratory analysis, basic model building — is being partially automated by AI tooling. But the demand for practitioners who can evaluate AI systems rigorously, apply statistical thinking to complex business problems, and build the data foundations on which AI depends has grown considerably. The candidates who will thrive over the next three years are those who understand where the discipline is heading — which specialisms are attracting the most investment, which technologies are reshaping what data scientists are expected to build and know, and how to position a data science career that will remain valuable as the field continues to evolve around them. This article breaks down what the UK data science jobs market is likely to look like through to 2028 — covering the titles emerging right now, the technologies driving employer demand, the skills that will matter most, and how to position your career ahead of the curve.