Book Image

Mastering Elasticsearch 5.x - Third Edition

Book Image

Mastering Elasticsearch 5.x - Third Edition

Overview of this book

Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. Elasticsearch leverages the capabilities of Apache Lucene, and provides a new level of control over how you can index and search even huge sets of data. This book will give you a brief recap of the basics and also introduce you to the new features of Elasticsearch 5. We will guide you through the intermediate and advanced functionalities of Elasticsearch, such as querying, indexing, searching, and modifying data. We’ll also explore advanced concepts, including aggregation, index control, sharding, replication, and clustering. We’ll show you the modules of monitoring and administration available in Elasticsearch, and will also cover backup and recovery. You will get an understanding of how you can scale your Elasticsearch cluster to contextualize it and improve its performance. We’ll also show you how you can create your own analysis plugin in Elasticsearch. By the end of the book, you will have all the knowledge necessary to master Elasticsearch and put it to efficient use.
Table of Contents (20 chapters)
Mastering Elasticsearch 5.x - Third Edition
Credits
About the Author
Acknowledgements
About the Reviewer
www.PacktPub.com
Customer Feedback
Preface

Chapter 9. Data Transformation and Federated Search

In the last chapter, we covered most of the topics related to Elasticsearch cluster administration. We started with describing different types of nodes and how to configure them, and then looked into Elasticsearch discovery and recovery modules in detail. Next, we covered the cat API of Elasticsearch, which is very useful for finding out node/cluster/index stats in a very readable format. Finally, we discussed snapshot and restore APIs which allow you to perform incremental backups to various repositories like shared file systems or the cloud, and to restore the snapshots back into the cluster.

In this chapter, we will cover one of the most exciting features introduced in Elasticsearch: ingest nodes. These allow us to preprocess the data into an Elasticsearch cluster itself before indexing. We will also look into how federated search works among different clusters using tribe nodes. By the end of this chapter, we will have covered following...