Book Image

Mastering Elasticsearch 5.x - Third Edition

Book Image

Mastering Elasticsearch 5.x - Third Edition

Overview of this book

Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. Elasticsearch leverages the capabilities of Apache Lucene, and provides a new level of control over how you can index and search even huge sets of data. This book will give you a brief recap of the basics and also introduce you to the new features of Elasticsearch 5. We will guide you through the intermediate and advanced functionalities of Elasticsearch, such as querying, indexing, searching, and modifying data. We’ll also explore advanced concepts, including aggregation, index control, sharding, replication, and clustering. We’ll show you the modules of monitoring and administration available in Elasticsearch, and will also cover backup and recovery. You will get an understanding of how you can scale your Elasticsearch cluster to contextualize it and improve its performance. We’ll also show you how you can create your own analysis plugin in Elasticsearch. By the end of the book, you will have all the knowledge necessary to master Elasticsearch and put it to efficient use.
Table of Contents (20 chapters)
Mastering Elasticsearch 5.x - Third Edition
Credits
About the Author
Acknowledgements
About the Reviewer
www.PacktPub.com
Customer Feedback
Preface

Summary


In this chapter, we started with different Apache Lucene scoring algorithms and learned about how to alter them and how to choose the right algorithm. Then we went through the store module of Elasticsearch and learned about different directory implementations for indices. We also covered near real-time searches, indexing, and learned about transaction logs configurations. Then, we looked into how segment merging works and what are all the possible ways to control the merge process. At the end, we discussed caching in Elasticsearch and the roles of circuit breakers.

Our next chapter is going to be a very important as well as interesting chapter as it focuses on Elasticsearch administration concepts. We will discuss Elasticsearch discovery, including the Amazon EC2 discovery module and the Elasticsearch recovery module, which helps users to configure them as per need. We will also cover the major changes done in Elasticsearch monitoring and then we will discuss the CAT API in detail...