

The Triage Shot Clock: When to Ask for Help During An Incident
A practical approach to setting time limits and escalating with intent.

Incidents are resolved by highly technical folks who have a deep understanding of the systems that failed. But depending on their impact, incidents may need to be understood by many more people than those who helped resolve them, from stakeholders to engineers from different teams.
However, pointing them to a lengthy postmortem document that references a codebase they’re not familiar with makes incidents difficult to grasp. That means responders are often dragged into meetings or are repeatedly asked questions about what happened.
What if responders could reduce the knowledge gap about the incident by providing a visualization of how the system was impacted? Better yet, what if responders could get these diagrams generated by an AI?
That’s the vision we worked towards on this Rootly AI Labs project, IncidentDiagram. IncidentDiagram is an open-source tool that uses several LLMs to generate incident diagrams from your postmortem document and codebase.
IncidentDiagram is a command-line tool designed to parse your incident retrospectives and your codebase to produce a visual representation of the incident's key events in a diagram.
IncidentDiagram is written in Python and relies on a series of specialized LLMs from OpenAI, Anthropic, and Gemini, all orchestrated through smolagents, an LLM framework from Hugging Face.
We put together a demo and a walk through of IncidentDiagram works on a short video:
In a nutshell, IncidentDiagram has three main stages and uses different agents at each:
IncidentDiagram is an open-source prototype, and we welcome contributions. There are many ways forward for the project. For example, there are more sources of information that can be used to build the diagram, such as specific commits mentioned in the postmortem document.
Whether it's improving the parsing capabilities, enhancing diagram aesthetics, or integrating with other tools, your input is valuable. You can find the project on GitHub: Rootly-AI-Labs/IncidentDiagram.