Book Image

Elasticsearch Indexing

By : Huseyin Akdogan
Book Image

Elasticsearch Indexing

By: Huseyin Akdogan

Overview of this book

Beginning with an overview of the way ElasticSearch stores data, you’ll begin to extend your knowledge to tackle indexing and mapping, and learn how to configure ElasticSearch to meet your users’ needs. You’ll then find out how to use analysis and analyzers for greater intelligence in how you organize and pull up search results – to guarantee that every search query is met with the relevant results! You’ll explore the anatomy of an ElasticSearch cluster, and learn how to set up configurations that give you optimum availability as well as scalability. Once you’ve learned how these elements work, you’ll find real-world solutions to help you improve indexing performance, as well as tips and guidance on safety so you can back up and restore data. Once you’ve learned each component outlined throughout, you will be confident that you can help to deliver an improved search experience – exactly what modern users demand and expect.
Table of Contents (15 chapters)
Elasticsearch Indexing
About the Author
About the Reviewer

Snapshot repository

When working with large amounts of data, backup and restore is an important requirement. Elasticsearch has a snapshot and restore module so that they meet the needs of users for backing up and restoring existing indices. Because Elasticsearch needs to know where to back up data, before backup and restore operations of the indices, a snapshot repository should be registered in Elasticsearch. The following is an example of how to register a snapshot repository:

curl -XPUT localhost:9200/_snapshot/my_backup -d '{
    "type": "fs",
    "settings": {
        "location": "/data/backups/my_backup",
        "compress": true,
        "chunk_size": "10m"

The preceding command registers a shared filesystem repository named as my_backup. It will use location of /data/backups/my_backup. The _snapshot is a REST endpoint for snapshot operations. Its first parameter is the repository name. Repository name will be specified by this parameter and must be unique. The type parameter...