Monitoring Elasticsearch

Monitoring Elasticsearch

By : Dan Noble, Pulkit Agrawal, Mahmoud Lababidi

Buy this Book

Monitoring Elasticsearch

By: Dan Noble, Pulkit Agrawal, Mahmoud Lababidi

Buy this Book

Overview of this book

ElasticSearch is a distributed search server similar to Apache Solr with a focus on large datasets, a schema-less setup, and high availability. This schema-free architecture allows ElasticSearch to index and search unstructured content, making it perfectly suited for both small projects and large big data warehouses with petabytes of unstructured data. This book is your toolkit to teach you how to keep your cluster in good health, and show you how to diagnose and treat unexpected issues along the way. You will start by getting introduced to ElasticSearch, and look at some common performance issues that pop up when using the system. You will then see how to install and configure ElasticSearch and the ElasticSearch monitoring plugins. Then, you will proceed to install and use the Marvel dashboard to monitor ElasticSearch. You will find out how to troubleshoot some of the common performance and reliability issues that come up when using ElasticSearch. Finally, you will analyze your cluster’s historical performance, and get to know how to get to the bottom of and recover from system failures. This book will guide you through several monitoring tools, and utilizes real-world cases and dilemmas faced when using ElasticSearch, showing you how to solve them simply, quickly, and cleanly.

Monitoring Elasticsearch

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Introduction to Monitoring Elasticsearch

An overview of Elasticsearch

Monitoring Elasticsearch

Resourcefulness and problem solving

Summary

Installation and the Requirements for Elasticsearch

Installing Elasticsearch

Configuring an Elasticsearch cluster

Understanding your cluster

Cluster requirements

Summary

Elasticsearch-head and Bigdesk

Cluster setup

Elasticsearch-head

Bigdesk

The Elasticsearch cat API

Summary

Marvel Dashboard

Setting up Marvel

Upgrading Marvel

Configuring Marvel

Marvel index configuration

Understanding the Marvel dashboard

Monitoring node failures

Summary

System Monitoring

Working with Kopf

Working with Logstash and Kibana

Working with Nagios

Command line tools for system and process management

Summary

Troubleshooting Performance and Reliability Issues

System configuration

The fielddata cache

Analyzing queries

Improving query performance

System and data architecting

Case studies

Summary

Node Failure and Post-Mortem Analysis

Diagnosing problems

Reviewing some case studies

Summary

Looking Forward

Elasticsearch 5 overview

Upgrading to Elasticsearch 5.0

Monitoring Elasticsearch 5

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Monitoring Elasticsearch

Monitoring distributed systems is difficult because as the number of nodes, the number of users, and the amount of data increase, problems will begin to crop up.

Furthermore, it may not be immediately obvious if there is an error. Often, the cluster will keep running and try to recover from the error automatically. As shown in Figures 1.2, 1.3, and 1.4 earlier, a node failed, but Elasticsearch brought itself back to a green state without any action on our part. Unless monitored, failures such as these can go unnoticed. This can have a detrimental impact on system performance and reliability. Fewer nodes means less processing power to respond to queries, and, as in the previous example, if another node fails, our cluster won't be able to return to a green state.

The aspects of an Elasticsearch cluster that we'll want to keep track of include the following:

Cluster health and data availability
Node failures
Elasticsearch JVM memory usage
Elasticsearch cache size
System utilization (CPU, Memory, and Disk)
Query response times
Query rate
Data index times
Data index rate
Number of indices and shards
Index and shard size
System configuration

In this book, we'll go over how to understand each of these variables in context and how understanding them can help diagnose, recover from, and prevent problems in our cluster. It's certainly not possible to preemptively stop all Elasticsearch errors. However, by proactively monitoring our cluster, we'll have a good idea of when things are awry and will be better positioned to take corrective action.

In the following chapters, we'll go over everything from web-based cluster monitoring tools to Unix command line tools and log file monitoring. Some of the specific tools this book covers are as follows:

Elasticsearch-head
Bigdesk
Marvel
Kopf
Kibana
Nagios
Unix command-line tools

These tools will give us the information we need to effectively diagnose, solve, and prevent problems with Elasticsearch.

Monitoring Elasticsearch

By : Dan Noble, Pulkit Agrawal, Mahmoud Lababidi

Monitoring Elasticsearch

By: Dan Noble, Pulkit Agrawal, Mahmoud Lababidi

Overview of this book

Related Content you might be interested in

Current Title:

Monitoring Elasticsearch

Monitoring Elasticsearch