Book Image

Datadog Cloud Monitoring Quick Start Guide

By : Thomas Kurian Theakanath
Book Image

Datadog Cloud Monitoring Quick Start Guide

By: Thomas Kurian Theakanath

Overview of this book

Datadog is an essential cloud monitoring and operational analytics tool which enables the monitoring of servers, virtual machines, containers, databases, third-party tools, and application services. IT and DevOps teams can easily leverage Datadog to monitor infrastructure and cloud services, and this book will show you how. The book starts by describing basic monitoring concepts and types of monitoring that are rolled out in a large-scale IT production engineering environment. Moving on, the book covers how standard monitoring features are implemented on the Datadog platform and how they can be rolled out in a real-world production environment. As you advance, you'll discover how Datadog is integrated with popular software components that are used to build cloud platforms. The book also provides details on how to use monitoring standards such as Java Management Extensions (JMX) and StatsD to extend the Datadog platform. Finally, you'll get to grips with monitoring fundamentals, learn how monitoring can be rolled out using Datadog proactively, and find out how to extend and customize the Datadog platform. By the end of this Datadog book, you will have gained the skills needed to monitor your cloud infrastructure and the software applications running on it using Datadog.
Table of Contents (19 chapters)
Section 1: Getting Started with Datadog
Section 2: Extending Datadog
Section 3: Advanced Monitoring

Understanding metrics in Datadog

The health of a software system and the infrastructure it is running on are defined by a set of metrics and their threshold values. For example, on the infrastructure side, if the CPU usage on a machine is under 70%, it might be considered healthy for a specific use case. When all such metrics that are used for monitoring an environment report values in the normal range, the entire environment can be considered healthy. By setting relevant thresholds for these metrics on monitors, issues can be reported as alerts. Datadog provides features to define metrics-based monitors and alerts.

We saw in Chapter 2, Deploying Datadog Agent, and Chapter 3, Datadog Dashboard, that published metrics can be viewed and filtered using tags in Metrics Explorer in the Datadog UI, as in the following example:

  1. Navigate to Metrics | Metrics Explorer to bring up the Metrics Explorer window:

    Figure 5.1 – Metrics Explorer

  2. In the Graph field, enter the name...