TLDR: Observe Inc. has launched two new AI agents, AI SRE and o11y.ai, designed to enhance engineering productivity, accelerate incident resolution, and significantly cut enterprise observability costs. These agents leverage an open data lake architecture and a knowledge graph to provide intelligent incident investigation, remediation, and streamlined code delivery.
San Mateo, California – November 4, 2025 – Observe Inc., a leader in AI-powered observability solutions, today announced the immediate availability of its new AI SRE and o11y.ai agents. These innovative tools are built upon Observe’s open data lake architecture and proprietary knowledge graph, aiming to revolutionize reliability engineering by accelerating developer productivity and substantially reducing enterprise observability expenses.
Key Innovations and Benefits:
Observe’s new AI agents are engineered to address critical challenges in modern observability, particularly the increasing complexity and cost of managing vast telemetry data. Early customer feedback highlights impressive results:
Incident triage is up to 10 times faster.
Mean time to resolution (MTTR) has been reduced from hours to minutes.
Overall observability costs are cut by up to 60%.
Jeremy Burton, CEO of Observe Inc., emphasized the shifting bottleneck in software delivery: “As AI code generation accelerates software delivery, the bottleneck has shifted to running and maintaining systems reliably at scale. AI SRE and o11y.ai directly address these pain points by making systems observable, reliable, and affordable from day one.”
AI SRE Agent: Automating Incident Response
The AI SRE agent is designed to automate and streamline incident response workflows. It autonomously applies context, identifies root causes, and proposes fixes, enabling engineering teams to troubleshoot incidents more rapidly and efficiently at scale. By leveraging a real-time contextual understanding of logs, metrics, and traces, AI SRE reduces operational toil, minimizes on-call burdens, and improves the accuracy of root cause identification. Its foundation on Observe’s scalable data lake architecture also facilitates longer data retention while achieving significant cost reductions.
Furthermore, the AI SRE agent offers extensive customization and extensibility through its Model Context Protocol (MCP) Server. This server integrates natively with leading AI tools such as Claude Code, OpenAI Codex, Augment Code, Windsurf, and n8n. The MCP Server utilizes Observe’s knowledge graph to enrich agents with context from massive volumes of observability data, leading to higher accuracy in incident resolution. This allows teams to integrate proprietary data, add custom context, automate complex workflows, and build bespoke AI agents tailored to their specific enterprise environments. Engineers can interact with their observability data in natural language directly within their code editors, eliminating the need to switch between multiple tools and query languages.
Brian Schneider, Senior Director of Engineering, Venue Systems, Topgolf, commented on the potential: “At Topgolf, we see tremendous opportunity for AI to streamline how our engineers interact with observability data directly within their coding environments. Tools like Observe’s MCP Server help increase developer velocity and reinforce our commitment to delivering reliable, high-quality Player experiences across our venues worldwide. It’s also a major unlock for our SRE practice, empowering teams to proactively analyze systems and address potential issues before they impact players.”
o11y.ai Agent: Developer-Centric Observability
The o11y.ai agent is specifically tailored for developers, making observability an integral part of the coding process. It enables developers to generate code instrumentation, debug applications, and query their applications directly. The agent automatically adds OpenTelemetry instrumentation, providing engineers with immediate access to essential logs, metrics, and traces. Developers can ask natural language questions about application usage, errors, and performance, as well as debug and validate fixes using contextual information from their telemetry and code.
Impact on Engineering Velocity:
With o11y.ai, customers are experiencing shorter feedback loops, faster root cause analysis, and overall higher engineering velocity.
Industry Context and Compliance:
The launch aligns with industry trends, as highlighted in the 2025 Gartner® Infrastructure and Operations (I&O) Signature Role Survey, which found that 54% of CIOs prioritize improving operational resilience, and 73% of I&O heads focus on cost optimization. Observe’s agents are designed to meet these goals affordably. The platform also includes built-in governance and compliance features, such as role-based access controls, SOC 2 Type II, ISO 27001, and GDPR support.
Adam Skobodzinski, Software Engineer at Foursquare, noted, “As data volumes grow, we’re excited about how AI-driven observability could help engineers connect the dots faster across logs, metrics, and traces to provide reliable services to users of our geospatial technology platform. Observe’s AI SRE and MCP Server have the potential to transform how we investigate incidents and reduce the time engineers spend on resolving issues by providing faster, more contextual insight into system behavior.”
Also Read:
- Resolve AI Secures $35M Seed Funding to Revolutionize Software Operations with AI Production Engineers
- Sauce Labs Unveils Sauce AI for Insights, Revolutionizing Software Quality with AI-Powered Intelligence
Observe Inc. is headquartered in San Mateo, California, and continues to redefine observability for the AI era, offering proactive, intelligent, and cost-efficient solutions at scale.


