Gartner names Komodor key vendor in AI SRE tooling
Komodor has been named a Representative Vendor in Gartner's Market Guide for AI Site Reliability Engineering (SRE) Tooling, as analysts predict rapid adoption of AI-led operational tools across large organisations.
Gartner argues that conventional SRE and operations teams are struggling with rising complexity, cost pressure, and reliability requirements. It forecasts that 85% of enterprises will use AI SRE tooling by 2029, up from less than 5% in 2025. The shift reflects growing interest in software that automates parts of incident response and infrastructure management in cloud-native environments.
AI SRE tooling sits at the intersection of observability, incident management, and automation. Vendors in this area analyse telemetry and event data, identify likely root causes, and recommend or execute remediation steps. The tools are also increasingly focused on prevention, including early warnings of performance degradation, anomaly detection, and configuration changes that increase outage risk.
The report also offers guidance for organisations adopting these products. It recommends augmenting existing SRE and operations teams with AI tooling, rather than rebuilding organisational structures. Gartner also highlights the use of telemetry, event correlation, and root cause analysis to support reliability-focused design and delivery workflows.
Komodor's platform
Komodor markets an AI SRE platform for cloud-native infrastructure and operations, used by platform engineering, DevOps, and SRE teams. It automates parts of troubleshooting and remediation, and analyses resource utilisation for cost management.
A central element is an autonomous AI agent called Klaudia. Komodor says it correlates telemetry across the cloud-native stack, analyses incidents, and helps prevent outages and performance degradation.
In practice, tools in this category typically ingest signals from logging, metrics, and tracing systems, then relate them to application deployments and infrastructure changes. The goal is to reduce the time engineers spend assembling incident timelines and to narrow likely causes across clusters, services, and add-on components common in Kubernetes-based environments.
Komodor also emphasises "explainable" root cause analysis. Explainability has become increasingly important as operations teams weigh whether to trust automated remediation. Many organisations want systems that can show supporting evidence for conclusions and actions, particularly when changes affect production systems.
Market pressures
Enterprises have become more reliant on distributed applications and managed cloud services, raising expectations for uptime and consistent user experience. It has also increased the number of components involved in delivering each customer-facing service, making fault isolation more difficult.
Organisations are also under pressure to manage cloud spending. FinOps practices have expanded, but cost optimisation can conflict with reliability when teams reduce redundancy or scale down resources. AI SRE vendors increasingly position their products to manage both by linking performance signals to infrastructure usage patterns.
Skills shortages add to the challenge. Experienced SRE and platform engineers remain in high demand, and many organisations struggle to staff 24/7 on-call rotas. Tools that triage alerts and reduce manual investigation are being adopted to reduce burnout and ensure consistent incident response.
What Gartner signals
Gartner's forecast suggests AI SRE tooling will move from early adoption to mainstream use in large organisations over the next few years. A similar pattern played out with observability platforms and incident management systems, which began as specialist tools and became standard procurement items as systems grew more distributed.
Broad adoption will depend on integration with existing monitoring and ticketing stacks and on governance of automated actions. Many organisations are likely to start with AI-led analysis and recommendations, then expand to automated remediation after validating accuracy and safety controls.
Komodor's inclusion in the Market Guide places it among vendors targeting this shift. Komodor says it works with large enterprises across several sectors, including financial services, healthcare, and retail. It has raised USD $90 million in venture funding, according to the company.
"Reliability has become a core requirement for modern, cloud-native systems, but many organizations are still constrained by cost, complexity, and skills gaps," said Ben Ofiri, CEO of Komodor. "We believe Gartner's inclusion of Komodor as a Representative Vendor reflects the growing need for AI-driven approaches that help teams move beyond reactive incident response toward proactive reliability, without requiring a complete reorganization or massive upfront investment."