Book Image

Elasticsearch for Hadoop

By : Vishal Shukla
Book Image

Elasticsearch for Hadoop

By: Vishal Shukla

Overview of this book

Table of Contents (15 chapters)
Elasticsearch for Hadoop
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


In this chapter, we discussed how to set up Storm to run in the local environment. You learned how to analyze a real-time streaming dataset with the the Twitter Trends Analyzer example. We created the Storm spouts and bolts to get real-time tweets and processed these tweets. We also created the Storm topology to configure our spouts and bolts with ES-Hadoop's EsBolt to inject tweets into Elasticsearch. We explored Elasticsearch's significant terms aggregation query to find the trends and unusually common patterns in the indexed data. We also used percolators to help us classify the documents with stored queries.

In the next chapter, you will understand the important Elasticsearch and ES-Hadoop concepts, such as shards, replicas, data colocations, and advanced configuration options. These concepts and configurations are essential to know before getting your wonderful application into production.