Elasticsearch Indexing

Elasticsearch Indexing

By : Huseyin Akdogan

Buy this Book

Elasticsearch Indexing

By: Huseyin Akdogan

Buy this Book

Overview of this book

Beginning with an overview of the way ElasticSearch stores data, you’ll begin to extend your knowledge to tackle indexing and mapping, and learn how to configure ElasticSearch to meet your users’ needs. You’ll then find out how to use analysis and analyzers for greater intelligence in how you organize and pull up search results – to guarantee that every search query is met with the relevant results! You’ll explore the anatomy of an ElasticSearch cluster, and learn how to set up configurations that give you optimum availability as well as scalability. Once you’ve learned how these elements work, you’ll find real-world solutions to help you improve indexing performance, as well as tips and guidance on safety so you can back up and restore data. Once you’ve learned each component outlined throughout, you will be confident that you can help to deliver an improved search experience – exactly what modern users demand and expect.

Elasticsearch Indexing

Credits

About the Author

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Introduction to Efficient Indexing

Getting started

Understanding the document storage strategy

Analysis

Summary

What is an Elasticsearch Index

Nature of the Elasticsearch index

Document

Summary

Basic Concepts of Mapping

Basic concepts and definitions

Types

The relationship between mapping and relevant search results

Understanding the schema-less

Summary

Analysis and Analyzers

Introducing analysis

Process of analysis

Built-in analyzers

What's text normalization?

ICU analysis plugin

An Analyzer Pipeline

Specifying the analyzer for a field in the mapping

Summary

Anatomy of an Elasticsearch Cluster

Basic concepts

Node

Shards

Replicas

Explaining the architecture of distribution

Correctly configuring the cluster

Choosing the right amount of shards and replicas

Summary

Improving Indexing Performance

Configuration

Optimization of mapping definition

Segments and merging policies

Store module

Bulk API

Notes

Summary

Snapshot and Restore

Snapshot repository

Snapshot

Restore

How does the snapshot process works?

Summary

Improving the User Search Experience

Correction of users' spelling mistakes

Get suggestions

Improving the relevancy of search results

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Choosing the right amount of shards and replicas

If you have a limited dataset and the dataset grows by a small amount, you can use only a single primary shard with a replica. If your dataset is not limited and grows by a large amount, the optimal number of shards is dependent on the target number of nodes.

Actually, a single node can be sufficient for many simple use cases, but to reduce the fault tolerance when considering the nature of distributed architecture and to prevent data loss, you can use more than one node. So, we need to find the answer to the first question: How many nodes will work?

Even to answer this question, we need to find out the answers to a few questions. For example: Do we need to use the non-data node? If we don't need to use non-data nodes, considering the Elasticsearch shard allocation policy, we can say that a node requires at least one shard to be the data node - as well as a replica. In that case, we can follow the following formula:

Max number of data nodes ...

Elasticsearch Indexing

By : Huseyin Akdogan

Elasticsearch Indexing

By: Huseyin Akdogan

Overview of this book

Related Content you might be interested in

Current Title:

Elasticsearch Indexing

Choosing the right amount of shards and replicas