Book Image

The DevOps 2.2 Toolkit

By : Viktor Farcic
Book Image

The DevOps 2.2 Toolkit

By: Viktor Farcic

Overview of this book

Building on The DevOps 2.0 Toolkit and The DevOps 2.1 Toolkit: Docker Swarm, Viktor Farcic brings his latest exploration of the Docker technology as he records his journey to explore two new programs, self-adaptive and self-healing systems within Docker. The DevOps 2.2 Toolkit: Self-Sufficient Docker Clusters is the latest book in Viktor Farcic’s series that helps you build a full DevOps Toolkit. This book in the series looks at Docker, the tool designed to make it easier in the creation and running of applications using containers. In this latest entry, Viktor combines theory with a hands-on approach to guide you through the process of creating self-adaptive and self-healing systems. Within this book, Viktor will cover a wide-range of emerging topics, including what exactly self-adaptive and self-healing systems are, how to choose a solution for metrics storage and query, the creation of cluster-wide alerts and what a successful self-sufficient system blueprint looks like. Work with Viktor and dive into the creation of self-adaptive and self-healing systems within Docker.
Table of Contents (18 chapters)

Alerting Humans

While Prometheus alerts are great by themselves, they are not very useful unless you're planning to spend all your time in front of the alerts screen. There are much better things to stare. For example, you can watch Netflix instead. It is much more entertaining than watching Prometheus screen. However, before you start watching Netflix during your working hours, we need to find a way so that you do get notified when an alert is fired.

Before we proceed, I must stress that alerts to humans (operators and sysadmins) are the last resort. We should receive them only if the system was not capable of fixing the problem. However, in the beginning, we do not have a self-healing system. The approach we'll take is to send each alert to a human. That's a quick fix. From there on, we'll try to build the system that will receive those alerts instead us...