Book Image

Elasticsearch for Hadoop

By : Vishal Shukla
Book Image

Elasticsearch for Hadoop

By: Vishal Shukla

Overview of this book

Table of Contents (15 chapters)
Elasticsearch for Hadoop
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Controlling the indexing process


Fundamentally, indexing is the process that helps you search faster. The most basic example you can see of indexing is the index of a book or document. You can quickly find the topic you are looking for, the associated page number, and thus the content you are looking for.

Let's consider an example of a simple index for the following two sentences:

  • Elasticsearch is an awesome search engine for fast full-text search and analytics

  • Elasticsearch can help you analyze big volume data quickly

One would create an index by simply giving an ID to both of these sentences and splitting them into tokens, as shown in the following table:

Id

Words

1

ElasticSearch, is, an, awesome, search, engine, for, fast, full-text, and, analytics

2

ElasticSearch, can, help, you, analyze, high-volume? data, quickly

This works great if you want to find the words that exist in the given document, meaning when you have got an ID in hand and are looking for the information associated...