

Benchmarking LLMs for SRE-tasks, boosting Sonnet 4.5 performance by 100%
The new edition of our benchmark features Terraform tasks across AWS, GPC, and Azure, plus incorporates a new dimension: prompt-optimization.
July 2, 2025
5 mins
Reliability engineering is evolving quickly—and AI is the catalyst. That’s why we’re excited to unveil Rootly AI Labs, a community-focused program dedicated to reshaping reliability through open collaboration, innovative prototypes, and cutting-edge research.
Reliability engineering is evolving quickly—and AI is the catalyst. That’s why we’re excited to unveil Rootly AI Labs, a community-focused program dedicated to reshaping reliability through open collaboration, innovative prototypes, and cutting-edge research.
We kicked things off with an exclusive event at GitHub’s San Francisco headquarters. Guests explored interactive demos and heard insights from leaders at Google, Anthropic, Andreessen Horowitz, Browserbase, Sentry, GitHub, Postman, and Rootly.
A key theme was the latest progress in Model Communication Protocol (MCP) servers and Agent-to-Agent (A2A) protocols—technologies that help developers and SRE teams boost productivity and chase the elusive “six nines” of uptime.
Expert panel (moderated by Sylvain Kalache, Head of Rootly AI Labs)
Rootly AI Labs is a collaborative hub designed for reliability engineers and researchers to develop transformative AI-based open-source tools, innovative prototypes, and cutting-edge research papers. All outputs are freely accessible, aiming to rapidly improve industry-wide operational excellence.
All projects are open-source (Apache 2.0) and free to use.
Backed by Anthropic and supported by top engineers from LinkedIn, Venmo, Twilio, and esteemed research universities such as Carnegie Mellon, Georgia Tech, and McGill, Rootly AI Labs is looking for new contributors and partners.
Visit our GitHub page to explore every project, watch creator interviews, and submit pull requests. Your feedback and ideas are always welcome.
Together, we’re redefining what’s possible in reliability engineering. Join us and be part of the future.