
Detecting hallucinations with LLM-as-a-judge: Prompt engineering and beyond
Discover how Datadog uses LLM-as-a-judge, structured output, and prompt engineering to detect hallucinations in RAG-based applications—at scale and in real time.
Blog
Discover how Datadog uses LLM-as-a-judge, structured output, and prompt engineering to detect hallucinations in RAG-based applications—at scale and in real time.
Learn how we developed Datadog Automatic Faulty Deployment Detection and improved precision, recall, and time to detection along the way.
Explore Toto, Datadog’s open source time series foundation model (TSFM), and BOOM, a new benchmark for observability metrics. Both are open source under the Apache 2.0 license and deliver state-of-the-art forecasting performance on real-world data.