Book Image

Hands-On Infrastructure Monitoring with Prometheus

By : Joel Bastos, Pedro Araújo
2 (1)
Book Image

Hands-On Infrastructure Monitoring with Prometheus

2 (1)
By: Joel Bastos, Pedro Araújo

Overview of this book

Prometheus is an open source monitoring system. It provides a modern time series database, a robust query language, several metric visualization possibilities, and a reliable alerting solution for traditional and cloud-native infrastructure. This book covers the fundamental concepts of monitoring and explores Prometheus architecture, its data model, and how metric aggregation works. Multiple test environments are included to help explore different configuration scenarios, such as the use of various exporters and integrations. You’ll delve into PromQL, supported by several examples, and then apply that knowledge to alerting and recording rules, as well as how to test them. After that, alert routing with Alertmanager and creating visualizations with Grafana is thoroughly covered. In addition, this book covers several service discovery mechanisms and even provides an example of how to create your own. Finally, you’ll learn about Prometheus federation, cross-sharding aggregation, and also long-term storage with the help of Thanos. By the end of this book, you’ll be able to implement and scale Prometheus as a full monitoring system on-premises, in cloud environments, in standalone instances, or using container orchestration with Kubernetes.
Table of Contents (21 chapters)
Free Chapter
1
Section 1: Introduction
5
Section 2: Getting Started with Prometheus
11
Section 3: Dashboards and Alerts
15
Section 4: Scalability, Resilience, and Maintainability

Introduction to the book and the technology

This book about Prometheus, the second project to graduate within the Cloud Native Computing Foundation (CNCF), will help you to crystallize the core fundamentals of monitoring and the approaches available to ensure the required infrastructure visibility. It relies on practical examples, using test environments and diagrams, to communicate knowledge in an easy-to-digest manner.

The content was designed to ensure that all the important Prometheus stack concepts are tackled. Our main goal during the writing process was to aim the book at our past selves and ensure that they would have everything they needed to know about this technology in this book.

From running one Prometheus server, to what scaling options are available, from creating and testing alerting rules, to templating slack notifications; and from useful dashboards, to automating target discovery; many other topics will be explained to ensure a full knowledge base on infrastructure monitoring using Prometheus as its cornerstone.