Complete Observability of IoT Systems | Datadog
CASE STUDY

Complete Observability of IoT Systems

Learn how Datadog’s IoT Agent helped Automotus gain visibility into edge devices and cloud services

About Automotus

Automotus is a curb management company that leverages computer vision software running on edge devices to help cities, fleets, and businesses increase revenue while making streets less congested, more sustainable, and more equitable.


Key Results

3x Increase

in firmware release cadence.

50% Reduction

in troubleshooting time for edge devices.

100% Increase

in the number of devices in production.


Challenge

Automotus needed a robust monitoring solution that would provide visibility into their IoT devices, as well as their scaling cloud resources.


Why Datadog?

Datadog’s unified platform gave Automotus complete visibility into their environment from edge to cloud, deepening their understanding of the health and performance of each piece of their system.


Automotus enables cities and businesses to analyze curbside activity, such as rideshare utilization and parking tickets, in order to increase revenue while minimizing traffic and congestion. The core of Automotus’s business relies on IoT devices, and if those devices fail, their business grinds to a halt. It’s therefore essential for Automotus to have total visibility into the health and performance of these devices, as well as the downstream applications and services that support them.

Harris Lummis, CTO and Co-founder of Automotus, recognized this need and began the search for a comprehensive monitoring solution for Automotus’s growing IoT system.

Limitations of Manual Troubleshooting

Before he found Datadog, Lummis was concerned about Automotus’s manual and reactive approach to monitoring. Their team was unable to collect important hardware metrics, such as network throughput, I/O load, and memory, which meant they often missed the first signs of degraded device performance. If their devices stopped sending messages, they were forced to SSH into the system and sort through logs by hand, which was an extremely time-consuming process that required all hands on deck. They also didn’t have visibility into the management and backend services that are crucial to their system, such as AWS IoT Core.

These problems were compounded by the absence of a centralized platform to view and analyze this data in context. The resulting blind spots stymied their troubleshooting process, leaving them to cross their fingers that nothing would go wrong.

Naturally, Lummis began to wonder: If troubleshooting was this hard while the product was still in alpha, how would they manage when it actually launched?

Discovering Datadog

In order to jumpstart their search for an appropriate monitoring solution, Automotus decided to contact Trek10, a Datadog Gold tier partner and an AWS-focused service provider with specialized expertise in next generation infrastructure. It was through their conversations with Trek10 that Automotus first realized how Datadog’s minimal overhead, low-maintenance approach to IoT monitoring could transform their engineering organization.

Datadog’s IoT Agent, together with IoT integrations with technologies such as AWS IoT Core, would allow them to view critical performance data from their NVIDIA Jetson-based devices alongside metrics, traces, and logs from their entire IoT ecosystem. Additionally, Datadog’s unified platform would allow them to eliminate blind spots and streamline their troubleshooting process, freeing up their engineers to spend more time building and enhancing their core product.

“ We searched long and hard for an IoT monitoring solution that was specifically built for resource-constrained environments. Datadog’s IoT Agent collects real-time metrics and logs from our devices without adding overhead, so our engineering team can get critical visibility into every deployment.”

Harris Lummis
Founder and CTO, Automotus

Full Visibility into IoT Ecosystems

With the help of Trek10, Automotus was able to onboard and ramp up with Datadog in no time. Trek10 deployed the Datadog IoT Agent to Automotus’s devices and enabled the integrations for the AWS services they rely on, giving them unprecedented visibility into their entire system. When their first outage occurred, it became clear how important Datadog would be in establishing scalable incident response best practices that would serve them for years to come.

Since then, Datadog has provided a holistic monitoring platform for Automotus, which includes:

  • Robust Integrations with IoT Technologies:
    Datadog’s integrations with key IoT technologies, such as AWS IoT Core, PostgreSQL, and RabbitMQ, enabled Automotus to get immediate visibility into the services that are crucial to their system. These integrations come equipped with out-of-the-box dashboards, where Automotus engineers can see key connectivity, messaging, and error metrics at a glance. If they notice concerning activity, such as an anomalous spike in queued messages or a sudden dip in active connections, they can pivot seamlessly to other parts of the platform, such as the Log Explorer and Infrastructure Monitoring, in order to investigate the issue further.

  • Visibility into Devices Living at the Edge:
    Trek10 helped Automotus build custom dashboards for device-level data collected by the IoT Agent. By viewing crucial system and performance metrics alongside logs and events, they are now able to get a high-level overview of device health and activity across their fleet. They can also use standardized tags such as ‘location’ to filter their data and isolate problems with specific subsets of devices, whether they’re running in California or Italy.

  • An Intelligent Alerting Infrastructure:
    IoT systems generate enormous volumes of data, which is often transmitted over unreliable network connections. This makes it difficult to implement an alerting infrastructure that doesn’t generate false positives or let real issues fall through the cracks. With Datadog, Automotus could set up sophisticated alerts for their entire fleet of devices, as well as their downstream services, ensuring that they only receive alerts for sustained, legitimate failures. They can also leverage machine learning-based features such as anomaly and outlier detection to automatically surface unusual activity that they may not have anticipated.

A Platform that Scales with You

With Datadog, Automotus was able to increase their release velocity while scaling up their entire IoT fleet. Automotus currently deploys new firmware three times a week, and Datadog Deployment Tracking enables them to monitor the performance of each deployment in real time. Now, Automotus can be confident in the knowledge that they have the tools they need to move their business forward as quickly as possible—without any maintenance or visibility tradeoffs.

“ We’ve been thrilled with the performance of the Datadog IoT Agent—it has lifted a huge burden from our team’s shoulders. Datadog gives us a single source of truth for diagnosing errors and even helps us proactively identify issues.”

Harris Lummis
Founder and CTO, Automotus

Resources

gated-asset/Azure_IoT_WhitePaper_GatedAssets_426x212

guide

Monitoring Azure IoT White Paper
gated-asset/AWS_gatedads_cloudmonitoring_thumbnail

ebook

Cloud-scale monitoring with AWS
/blog/datadog-iot-agent/iot-agent-hero

BLOG

Introducing the Datadog IoT Agent