Top sre AI tools

Onepane is a GenAI solution for IT Managers, DevOps, and SREs, offering unified insights and control over cloud resources to accelerate incident resolution and optimize operations.
- Freemium
- From 500$

NeuBird Hawkeye is an AI-powered SRE agent designed to dramatically reduce MTTR and transform IT operations. It analyzes complex IT issues instantly, enabling problem resolution in minutes.
- Contact for Pricing

Wild Moose is an AI-powered SRE copilot that provides fast, efficient root cause analysis, improving with every incident to end downtime before it starts.
- Paid
- From 800$

SmythOS is a no-code platform that enables rapid AI agent creation and deployment. It offers a visual builder, universal API integration, and pre-built agent templates for various use cases.
- Freemium

Botkube is a Kubernetes troubleshooting platform that provides alerts, investigation tools, and remediation steps directly within your chat platform. It helps DevOps teams quickly resolve Kubernetes issues.
- Paid
- From 10$

Cleric is an autonomous AI site reliability engineer that root causes alerts from production applications without requiring runbooks. It frees on-call engineers from time-consuming investigations.
- Contact for Pricing

Blameless is an incident management platform utilizing automation and AI to help engineering teams streamline response, improve communication, and enhance system reliability.
- Free Trial
- From 30$

K8sGPT is a tool for scanning Kubernetes clusters, diagnosing, and triaging issues in plain English. It leverages AI to enrich analysis and provide actionable insights.
- Free

KubeHA automates Kubernetes alert analysis and remediation, leveraging GenAI to streamline recovery and improve operational efficiency. It reduces downtime and enhances system reliability.
- Free Trial

Resolvd automates IT and cloud tasks with AI-powered scripts and agents, streamlining workflows like onboarding, provisioning, and access management.
- Contact for Pricing

Parity is an AI-powered SRE platform that provides automated incident response and investigation for Kubernetes clusters, reducing MTTR and improving on-call experience.
- Paid
- From 250$

Squadcast is a reliability automation platform designed to streamline incident response, reduce downtime, and enhance team delivery by unifying on-call and incident management workflows. It leverages AI for continuous learning and improved system reliability.
- Freemium
- From 12$
More Tags
Didn't find tool you were looking for?