Aboutincident.io
incident.iois the leading all-in-one platform for incident management. From small bugs to major outages,incident.iohelps teams respond fast, reduce downtime, and improve every time something goes wrong.
Since launching in 2021, we’ve helped 800 companies—including Netflix, Airbnb and Block—resolve over 250,000 incidents. Every month, more than 30,000 responders across Engineering, Product and Support useincident.ioto fix things faster.
We’re a small team that cares deeply about pragmatism, quality, magic, and pace. We are backed by leading investors like Index Ventures, and Point Nine, alongside many angel investors who are founders and executives of world-class companies.
What we’re building
When site outages, data breaches, or functionality issues strike,incident.iobrings together the right people, streamlines response efforts, and provides tools to fix issues quickly.
Over the last 9 months, we’ve built some of the most ambitious AI-native features in our product. We’re now working on Investigations—an AI agent that identifies problems, explains root causes, and eventually fixes issues for you. Were not just applying AI to our product; were reimagining incident response with AI, working with cutting-edge technologies to build something completely new.
What we think we can achieve keeps increasing, and we’re looking to grow the team with AI engineers to make it happen.
What AI Engineering looks like here
We’ve gone from sprinkling moments of magic with AI to building an investigative agent, a copilot and more. All powered by a world-class internal platform: one that helps us run and evaluate LLMs, orchestrate complex agents, and build safety checks around the edges. You’ll be joining this established team and continue to solve complex problems usingLLMs.
To power this, the team has embraced the inherent challenges of working with non-deterministic systems. We’ve built our own eval frameworks, orchestration engines, and debugging tools—because the ones on the market weren’t good enough. You’ll use these systems to improve the performance of our AI agents and help develop these frameworks further.
We prioritize pace and impact for customers, above all else. Youll be building AI that transforms the most stressful moments in engineering, for world-class companies like Netflix, Block, and Airbnb. Your work will directly impact how some of the worlds most innovative companies respond when things go wrong, saving countless hours of engineering time.
What you’ll do
Design, develop, and implement agentsthat can investigate incidents, provide insights, and suggest solutions.
Drive end-to-end development of AI infrastructure and applications:building real products people use, not just research prototypes.
Build AI evaluation frameworksand implement systems to measure and improve AI performance.
What you can bring to the team
We dont expect you to have experience with everything in our stack, but we do look for strong fundamentals, a passion for solving complex problems, and the ability to navigate ambiguity with confidence.
Excitement to build a state-of-the-art AI applicationwith the frontier models, tools, and techniques that will be used by the best AI labs & product companies in the world.
3+ years building production-ready applicationsin programming languages such as Go, Python, Ruby on Rails, or React—similar languages used in AI development.
Customer-focused and product-minded approachto building AI solutions that solve real user problems.
Comfortable with the unknownand adept at navigating the unique challenges of AI product development.
Our tech stack
Our primary development stack includes Go, TypeScript with React, and Postgres, with deployment on Google Cloud Platform using GKE and Cloud SQL.
Our stack for AI development is somewhat different than most. We do rely on products like Anthropic, Vertex, and OpenAI but weve also invested heavily in ourinternal AI tooling.
We developed our own library for building agents, evaluating performance, and backtesting changes - we call this the “Workbench,” which means were able to take an idea very quickly to production and then monitor how its performing in the wild.
The salary for this position is determined by several job-related factors, such as experience, relevant skills, training, location, business needs, or market demands. The salary range for this role is£110,000-£130,000, plus offers equity options.
Our commitment to diversity
We embrace diversity atincident.io, and believe in creating supportive and inclusive environments where all of our employees can succeed. To build a product that’s loved by everyone, we need a team with all kinds of different perspectives, experiences, and backgrounds. That’s why we’re committed to hiring people regardless of race, religion, colour, national origin, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, age, neurodiversity status, disability status, or otherwise.
Got a question?
If you have any questions before applying to the role, please email the team at .
J-18808-Ljbffr