Search - Workchat - Applied Data Scientist II

Elastic
City of London
4 months ago
Applications closed

Related Jobs

View all jobs

Data Science Lead

Data Scientist

Data Engineer - Shared Platform Services - SC Contract

Data Engineer - Visualisation

Data Scientist / AI Engineer

Data Engineer - SC Contract

Search - Workchat - Applied Data Scientist II

Join to apply for the Search - Workchat - Applied Data Scientist II role at Elastic


Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale—unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data—securing and protecting private information more effectively—Elastic’s complete, cloud‑based solutions for search, security, and observability help organizations deliver on the promise of AI.


What Is The Role

The Search Conversational Experiences team builds Elastic’s new conversational (agentic) platform that lets customers chat with their own data in Elasticsearch. We own the quality layer for RAG, agents and tools, retrieval/citations, streaming, memory, and—crucially—the evaluation signals that turn open‑ended questions into grounded, reliable answers. As a Data Scientist, you’ll be part of a cross‑functional team (backend, DS, PM, UX) driving chat quality end‑to‑end: designing and running evaluation pipelines, improving prompts and tool behaviors, and turning measurements into product decisions that customers can feel.


You’ll help tackle frontier problems—folding RAG and vector search into an agent’s knowledge base, dynamically enriching model context to boost groundedness, shaping agent routing and tool selection policies, lighting up agent‑driven visualizations on top of Elasticsearch data, and exploring multimodality and reasoning strategies where they truly move the needle. This is an applied role: you will prototype, evaluate, and partner with engineers to ship.


What You Will Be Doing

  • Own well scoped pieces of the offline and online evaluation pipeline for agent workflows: retrieval coverage, reranking quality, reasoning traces, tool selection accuracy, citation integrity, and final answer helpfulness and faithfulness.
  • Calibrate and validate LLM‑as‑judge rubrics against human labels, track agreement with statistics, and add periodic checks to prevent drift.
  • Instrument agent runs with traces so you can localize errors to retrieval, reasoning, tool execution, or grounding, then contribute CI checks that block merges on regressions.
  • Translate evaluation readouts into product calls such as model choice, routing policy, tool gating thresholds, prompt and chunking updates, and agent customization for Elastic use cases.
  • Collaborate with backend engineers on contracts for ES|QL, citations, and telemetry schemas, and with PM and UX to land findings in shipped features.
  • Share outcomes through clear docs, notebooks, and PRs, and contribute utilities that make evaluation faster and more reproducible for the team.

What You Will Bring

  • 3 to 5 years in applied DS or ML with production ownership, including at least 1 to 2 years focused on evaluating LLM or agent workflows in shipped systems.
  • Proven experience designing and running stepwise evaluations for agent pipelines: retrieval coverage, reranking quality, reasoning traces, tool selection accuracy, citation grounding, and final answer helpfulness and faithfulness.
  • Golden set hygiene: stratified dataset design, leakage controls, reviewer guidelines, inter‑rater checks, and versioned labels.
  • Fluent with offline IR metrics and guardrails: Recall@k, nDCG, MRR, groundedness or citation support, plus latency and cost tracking; can move from offline gains to online A/B tests.
  • Telemetry and traces for agent runs that localize failures to retrieval, reasoning, tool execution, or grounding; ability to add CI quality gates that block merges on regressions.
  • Practical Elasticsearch experience or a similar search system; ES|QL familiarity is a plus.
  • Strong written communication and async collaboration habits in a distributed team.

Compensation

Base salary only, no variable component. Typical starting salary ranges:



  • General locations: $110,900—$175,500 USD
  • Seattle, LA, SF Bay Area, New York City metro area: $133,200—$210,700 USD

Benefits

  • Competitive pay based on the work you do here and not your previous salary.
  • Health coverage for you and your family in many locations.
  • Ability to craft your calendar with flexible locations and schedules for many roles.
  • Generous number of vacation days each year.
  • Match up to $2,000 for financial donations and service.
  • Up to 40 hours each year to use toward volunteer projects you love.
  • Embracing parenthood with minimum of 16 weeks of parental leave.
  • Company‑matched 401(k) with dollar‑for‑dollar matching up to 6% of eligible earnings.
  • Participation in Elastic’s stock program.

EEO Statement

Elastic is an equal opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state or local law, ordinance or regulation. We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals.


Applicants have rights under Federal Employment Laws; view posters linked below: Family and Medical Leave Act (FMLA) Poster; Pay Transparency Nondiscrimination Provision Poster; Employee Polygraph Protection Act (EPPA) Poster; and Equal Employment Opportunity (EEO) Poster.


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Science Tools Do You Need to Know to Get a Data Science Job?

If you’re trying to break into data science — or progress your career — it can feel like you are drowning in names: Python, R, TensorFlow, PyTorch, SQL, Spark, AWS, Scikit-learn, Jupyter, Tableau, Power BI…the list just keeps going. With every job advert listing a different combination of tools, many applicants fall into a trap: they try to learn everything. The result? Long tool lists that sound impressive — but little depth to back them up. Here’s the straight-talk version most hiring managers won’t explicitly tell you: 👉 You don’t need to know every data science tool to get hired. 👉 You need to know the right ones — deeply — and know how to use them to solve real problems. Tools matter, but only in service of outcomes. So how many data science tools do you actually need to know to get a job? For most job seekers, the answer is not “27” — it’s more like 8–12, thoughtfully chosen and well understood. This guide explains what employers really value, which tools are core, which are role-specific, and how to focus your toolbox so your CV and interviews shine.

What Hiring Managers Look for First in Data Science Job Applications (UK Guide)

If you’re applying for data science roles in the UK, it’s crucial to understand what hiring managers focus on before they dive into your full CV. In competitive markets, recruiters and hiring managers often make their first decisions in the first 10–20 seconds of scanning an application — and in data science, there are specific signals they look for first. Data science isn’t just about coding or statistics — it’s about producing insights, shipping models, collaborating with teams, and solving real business problems. This guide helps you understand exactly what hiring managers look for first in data science applications — and how to structure your CV, portfolio and cover letter so you leap to the top of the shortlist.

The Skills Gap in Data Science Jobs: What Universities Aren’t Teaching

Data science has become one of the most visible and sought-after careers in the UK technology market. From financial services and retail to healthcare, media, government and sport, organisations increasingly rely on data scientists to extract insight, guide decisions and build predictive models. Universities have responded quickly. Degrees in data science, analytics and artificial intelligence have expanded rapidly, and many computer science courses now include data-focused pathways. And yet, despite the volume of graduates entering the market, employers across the UK consistently report the same problem: Many data science candidates are not job-ready. Vacancies remain open. Hiring processes drag on. Candidates with impressive academic backgrounds fail interviews or struggle once hired. The issue is not intelligence or effort. It is a persistent skills gap between university education and real-world data science roles. This article explores that gap in depth: what universities teach well, what they often miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data science.