Blog

Incident management insights, guides, and product updates from Rootly

Search...
No items found.
Benchmarking LLMs for SRE-tasks, boosting Sonnet 4.5 performance by 100%

Benchmarking LLMs for SRE-tasks, boosting Sonnet 4.5 performance by 100%

The new edition of our benchmark features Terraform tasks across AWS, GPC, and Azure, plus incorporates a new dimension: prompt-optimization.

Sylvain Kalache

Sylvain Kalache

October 8, 2025
10 mins
Benchmarking LLMs for SRE-tasks, boosting Sonnet 4.5 performance by 100%

Benchmarking LLMs for SRE-tasks, boosting Sonnet 4.5 performance by 100%

The new edition of our benchmark features Terraform tasks across AWS, GPC, and Azure, plus incorporates a new dimension: prompt-optimization.

Sylvain Kalache

Sylvain Kalache

October 8, 2025
10 mins
No items found.
Introducing the On-Call Burnout Detector

Introducing the On-Call Burnout Detector

An open source, research-based tool that looks for early-warning signs of burnout in your on-call engineers.

Sylvain Kalache

Sylvain Kalache

September 25, 2025
5 mins
Introducing the On-Call Burnout Detector

Introducing the On-Call Burnout Detector

An open source, research-based tool that looks for early-warning signs of burnout in your on-call engineers.

Sylvain Kalache

Sylvain Kalache

September 25, 2025
5 mins
No items found.
From Hype to Hard Lessons in Agentic AI

From Hype to Hard Lessons in Agentic AI

The panel warned: the opportunity is massive, but without observability, security, and strategy, the regrets will be real.

Andre King

Andre King

September 22, 2025
8 mins
From Hype to Hard Lessons in Agentic AI

From Hype to Hard Lessons in Agentic AI

The panel warned: the opportunity is massive, but without observability, security, and strategy, the regrets will be real.

Andre King

Andre King

September 22, 2025
8 mins
No items found.
SRECon EMEA 2025: Top Talks + Events

SRECon EMEA 2025: Top Talks + Events

5 AI and reliability talks you can’t miss, plus the perfect after-conference events to wrap up Days 1 and 2 in Dublin

Sylvain Kalache

Sylvain Kalache

September 16, 2025
7 mins
SRECon EMEA 2025: Top Talks + Events

SRECon EMEA 2025: Top Talks + Events

5 AI and reliability talks you can’t miss, plus the perfect after-conference events to wrap up Days 1 and 2 in Dublin

Sylvain Kalache

Sylvain Kalache

September 16, 2025
7 mins
No items found.
The Art of Incident Management, Part I

The Art of Incident Management, Part I

“Art, in itself, is an attempt to bring order out of chaos.” - Stephen Sondheim

Jorge Lainfiesta

Jorge Lainfiesta

September 9, 2025
4 mins
The Art of Incident Management, Part I

The Art of Incident Management, Part I

“Art, in itself, is an attempt to bring order out of chaos.” - Stephen Sondheim

Jorge Lainfiesta

Jorge Lainfiesta

September 9, 2025
4 mins
No items found.
Rootly joins Groq OpenBench with an SRE-focused benchmark

Rootly joins Groq OpenBench with an SRE-focused benchmark

Making LLM evaluations reproducible for real-world SRE workflows

Sylvain Kalache

Sylvain Kalache

August 28, 2025
5 mins
Rootly joins Groq OpenBench with an SRE-focused benchmark

Rootly joins Groq OpenBench with an SRE-focused benchmark

Making LLM evaluations reproducible for real-world SRE workflows

Sylvain Kalache

Sylvain Kalache

August 28, 2025
5 mins
No items found.
How to Structure an Incident Response Team: Roles, Responsibilities, and Workflows

How to Structure an Incident Response Team: Roles, Responsibilities, and Workflows

Learn how to structure an incident response team with defined roles, responsibilities, and workflows to reduce downtime and improve resilience.

Alexandra Chaplin

Alexandra Chaplin

August 26, 2025
6 mins
How to Structure an Incident Response Team: Roles, Responsibilities, and Workflows

How to Structure an Incident Response Team: Roles, Responsibilities, and Workflows

Learn how to structure an incident response team with defined roles, responsibilities, and workflows to reduce downtime and improve resilience.

Alexandra Chaplin

Alexandra Chaplin

August 26, 2025
6 mins
No items found.
Incident Response Process: SRE Teams Step-by-Step Guide

Incident Response Process: SRE Teams Step-by-Step Guide

Discover the complete incident response process for SRE teams. From detection to postmortems, learn how to manage incidents with clarity and speed.

JP Cheung

JP Cheung

August 26, 2025
8 mins
Incident Response Process: SRE Teams Step-by-Step Guide

Incident Response Process: SRE Teams Step-by-Step Guide

Discover the complete incident response process for SRE teams. From detection to postmortems, learn how to manage incidents with clarity and speed.

JP Cheung

JP Cheung

August 26, 2025
8 mins
No items found.
AI in Incident Response: How Automation Improves MTTR

AI in Incident Response: How Automation Improves MTTR

Discover how AI in incident response cuts MTTR through rapid detection, automated triage, and faster resolution, boosting uptime and reliability.

Kayla Thomson

Kayla Thomson

August 21, 2025
4 mins
AI in Incident Response: How Automation Improves MTTR

AI in Incident Response: How Automation Improves MTTR

Discover how AI in incident response cuts MTTR through rapid detection, automated triage, and faster resolution, boosting uptime and reliability.

Kayla Thomson

Kayla Thomson

August 21, 2025
4 mins