SecurityBrief Canada - Technology news for CISOs & cybersecurity decision-makers
Databricks

Datadog launches Bits AI SRE to automate incident resolution

Wed, 3rd Dec 2025

Datadog has introduced an artificial intelligence-driven agent designed to streamline the process of incident response for engineering teams. The new agent, called Bits AI SRE, automatically investigates technical alerts, determines probable root causes, and shares recommended actions within minutes.

Incident management

Rapid incident response remains central to minimising disruption in digitally-enabled businesses. Delays in identifying and resolving technical issues can result in lost customer trust and financial impact. Bits AI SRE aims to address this by autonomously monitoring telemetry data, organisational context, and established architecture. It leverages Datadog's platform-level intelligence to deliver root cause analyses before human responders begin their tasks.

According to Datadog, Bits AI SRE analyses runbooks, logs, and system telemetry in real time, filtering out irrelevant signals and prioritising actionable information for on-call teams. The agent integrates with third-party collaboration platforms to deliver findings directly to the engineers who need them.

Reducing manual work

This approach is intended to reduce the hours engineers spend on manual troubleshooting, shifting much of the investigative burden to automation. In practice, the system validates its own conclusions and provides guidance on next steps based on available diagnostics.

Enterprise customers have reported improvements in mean time to resolution (MTTR) - a standard industry measure of how quickly technical problems are solved. Bits AI SRE has so far operated across more than 2,000 client environments, handling a range of technical alerts from routine issues to critical incidents.

Enterprise-scale security

BITS AI SRE is designed for large-scale operations. It supports role-based access controls and complies with standards such as HIPAA. Datadog notes the inclusion of enterprise contracts with recognised AI vendors, reflecting increased focus on secure deployment of artificial intelligence in production settings.

"This launch represents a pivotal expansion of Datadog's AI strategy as our first generally available AI agent, and signals a new phase of intelligent, automated reliability. Bits AI SRE allows companies to mitigate issues faster, reduce customer impact, and adopt AI safely. It has already been tested against more than 2,000 customer environments, including both global enterprises and fast-growing start-ups with a diverse range of production environments. Tens of thousands of investigations have run to date, from routine alerts to high-severity incidents, with organisations already reporting positive outcomes. This reflects the tangible and immediate value, tied directly to operational and business outcomes, that we are delivering," said Yanbing Li, Chief Product Officer, Datadog.

Customer perspectives

Users in a variety of sectors have reported a positive shift in how incidents are managed. One metric highlighted has been the reduction in cognitive load for engineers by automatically surfacing the most relevant signals during high-pressure events.

"During an incident, the first five minutes are critical. Bits AI helps us cut through the noise by instantly surfacing the right context and correlations across our systems. With smart tagging and naming, it automatically guides engineers to the right information, reducing cognitive load and giving us clarity and control when it matters most," said Thiyagarajan Anandan, Senior Engineering Manager, Uber Freight.

"With Bits AI SRE being on-call 24/7 for us, MTTR for our services have improved significantly. For most cases, the investigation is already taken care of well before our engineers sit down and open their laptops to assess the issue," said Andrew Seok Ju Kim, Data Engineer, DelightRoom.

Follow us on:
Follow us on LinkedIn Follow us on X
Share on:
Share on LinkedIn Share on X