Datadog APM | Datadog

Datadog Application Observability

Troubleshoot, optimize, and secure your applications faster with end-to-end distributed tracing and service-centric observability at scale, correlated with all telemetry types.

Why Datadog?

End-to-End Distributed Tracing

Set up in minutes, and instantly visualize your traces with full control over ingestion, retention, and costs


Code Hotspots And Thread Level Insights

Optimize the most resource consuming lines of production code with our low-overhead Continuous Profiler


Automatic Deployment Tracking

Requests, errors, and latency metrics for every code deploy including out-of-the-box code version comparisons


Superior Granularity & Retention

0-second OOTB high resolution, 1-second availability for critical metrics, 15-month retention, and no roll-up, ever


Product Benefits

Find Root Causes Faster with Thread-Level Distributed Tracing

  • Easily pin bottlenecks down to the method or line of code, including slow I/O and lock contention and inefficient garbage collection, with full-stack distributed tracing and thread-level continuous profiling
  • Identify root causes quickly with traces automatically correlated with your logs, infrastructure metrics, database queries, network calls, frontend telemetry and more—all in one view
  • Monitor OTel-instrumented apps with support for OTel API and Collector within the Datadog Agent for full interoperability
dg/waterfall_thread_timeline.png

Get Live Visibility and Complete Control Over Traces

  • Search and analyze your ingested traces live over the last 15 minutes
  • Filter traces based on trace-level attributes, service relationships, endpoints, and other properties, all without needing to learn a complex query language
  • Retain errors and high latency traces automatically for 15 days
  • Control cost-visibility tradeoffs with fine-grained ingestion controls and tag-based retention filters

Instantly Generate Logs, Spans, Metrics, and Span Tags without Changing Code

  • Expedite debugging of production issues with granular insights into application behavior and service interactions by adding log statements without changing code
  • Add spans to troubleshoot slow requests and specific operations in your application without leaving the Datadog platform
  • Create metrics on the fly that measure the time any method in your code is consuming in production, and use metric expressions to focus on specific requests
products/dynamic-instrumentation/dynamic-logs.png

Receive Alerts Only for the Issues that Matter and Eliminate False-Positives

  • Set up recommended alerts with 1 click for anomalies and outliers that account for daily, weekly, and seasonal fluctuations
  • Proactively prevent outages and errors in the future by alerting on metric forecasts
  • Combine alerts into composite alerts for greater granularity and stronger signal to reduce the noise
  • Automatically detect unanticipated outliers, anomalies, and errors with Watchdog
products/alerts/watchdog-machine-learning-alerts.jpeg

Centralize Your Service Knowledge and Operations

  • Achieve end-to-end service ownership at scale, get real-time performance insights, detect and address reliability and security risks, and manage application dependencies all in one place
  • Get RED metrics based on 100% of traffic with 15-month retention so you can search, analyze, and visualize any trace using any tag
  • Automatically discover, catalog, and monitor services—no instrumentation code changes necessary—with Universal Service Monitoring
  • Reduce mean time to detection through automatic dependency mapping, powered by eBPF technology

Full-Stack Defense Across Apps, Workloads, and Infrastructure

  • Track your security posture easily with out-of-the-box threat activity, exposure, and vulnerabilities ratings captured in the Datadog Severity Score
  • Triage vulnerability impact in full context with continuous runtime scans across open source libraries
  • Remediate issues with out-of-the-box actionable guidance and automatic correlation between your application and infrastructure
  • Quickly discover code vulnerabilities and attack attempts in your Java, .NET, PHP, Node.js, Ruby, Python, Go, and C++ applications
products/app-sec/asm-vulnerability-found-v2.png

Thread-level Insights into Performance Bottlenecks in Production

  • Reduce your production code’s latency and resource consumption with continuous code profiling
  • Identify inefficiencies and apply suggested fixes to optimize application code, sidestepping lengthy reproduction processes with automated insights
  • Leverage a detailed chronological visualization of code and runtime activity—grouped by threads, fibers, goroutines, or event loops—filtered to a container or a specific trace to diagnose performance issues

Ensure Smooth Deployments and Eliminate Performance Regressions

  • Follow issues over time to know when they first started, if they are still ongoing, and how often they occur with Error Tracking
  • Compare application performance and impact across hosts, versions, and time ranges during rolling, canary, blue/green, or shadow deploys
  • Quickly troubleshoot faulty deployments with automatic faulty deployment detection and decide whether to roll back or ship a fix

The Essential Monitoring and Security Platform for the Cloud Age

Datadog brings together end-to-end traces, metrics, and logs to make your applications, infrastructure, and third-party services entirely observable.

Platform Diagram

Loved & Trusted by Thousands

Washington Post logo 21st Century Fox Home Entertainment logo Peloton logo Samsung logo Comcast logo Nginx logo