Elasticsearch Essentials

Elasticsearch Essentials

Overview of this book

With constantly evolving and growing datasets, organizations have the need to find actionable insights for their business. ElasticSearch, which is the world's most advanced search and analytics engine, brings the ability to make massive amounts of data usable in a matter of milliseconds. It not only gives you the power to build blazing fast search solutions over a massive amount of data, but can also serve as a NoSQL data store. This guide will take you on a tour to become a competent developer quickly with a solid knowledge level and understanding of the ElasticSearch core concepts. Starting from the beginning, this book will cover these core concepts, setting up ElasticSearch and various plugins, working with analyzers, and creating mappings. This book provides complete coverage of working with ElasticSearch using Python and performing CRUD operations and aggregation-based analytics, handling document relationships in the NoSQL world, working with geospatial data, and taking data backups. Finally, we’ll show you how to set up and scale ElasticSearch clusters in production environments as well as providing some best practices.

Elasticsearch Essentials

Credits

About the Author

Acknowledgments

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Getting Started with Elasticsearch

Introducing Elasticsearch

Installing and configuring Elasticsearch

Basic operations with Elasticsearch

Summary

Understanding Document Analysis and Creating Mappings

Text search

Document analysis

Elasticsearch mapping

Summary

Putting Elasticsearch into Action

CRUD operations using elasticsearch-py

CRUD operations using Java

Creating a search database

Elasticsearch Query-DSL

Understanding Query-DSL parameters

Search requests using Python

Search requests using Java

Sorting your data

Document routing

Summary

Aggregations for Analytics

Introducing the aggregation framework

Metric aggregations

Bucket aggregations

Combining search, buckets, and metrics

Memory pressure and implications

Summary

Data Looks Better on Maps: Master Geo-Spatiality

Introducing geo-spatial data

Working with geo-point data

Geo-aggregations

Geo-shapes

Summary

Document Relationships in NoSQL World

Relational data in the document-oriented NoSQL world

Working with nested objects

Parent-child relationships

Considerations for using document relationships

Summary

Different Methods of Search and Bulk Operations

Introducing search types in Elasticsearch

Cheaper bulk operations

Multi get and multi search APIs

Data pagination

Practical considerations for bulk processing

Summary

Controlling Relevancy

Introducing relevant searches

The Elasticsearch out-of-the-box tools

Controlling relevancy with custom scoring

Summary

Cluster Scaling in Production Deployments

Node types in Elasticsearch

Introducing Zen-Discovery

Node upgrades without downtime

Upgrading Elasticsearch version

Best Elasticsearch practices in production

Creating a cluster

Scaling your clusters

Summary

Backups and Security

Introducing backup and restore mechanisms

Securing Elasticsearch

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Memory pressure and implications

Aggregations are awesome! However, they bring a lot of memory pressure on Elasticsearch. They work on an in-memory data structure called fielddata, which is the biggest consumer of HEAP memory in a Elasticsearch cluster. Fielddata is not only used for aggregations, but also used for sorting and scripts. The in-memory fielddata is slow to load, as it has to read the whole inverted index and un-invert it. If the fielddata cache fills up, old data is evicted causing heap churn and bad performance (as fielddata is reloaded and evicted again.)

The more unique terms exist in the index, the more terms will be loaded into memory and the more pressure it will have. If you are using an Elasticsearch version below 2.0.0 and above 1.0.0, then you can use the doc_vlaues parameter inside the mapping while creating the index to avoid the use of fielddata using the following syntax:

PUT /index_name/_mapping/index_type
{
  "properties": {
    "field_name": {
      "type": ...

Elasticsearch Essentials

Elasticsearch Essentials

Overview of this book

Related Content you might be interested in

Current Title:

Elasticsearch Essentials

Memory pressure and implications