The common way of data modeling for relational database is to reduce data redundancy and improve data integrity by using a normalization process. However, Elasticsearch is not ;a relational database and, as such, data modeling does not apply to it. Like most NoSQL databases, Elasticsearch treats the real world as a flat world, which means each document is independent. Basically, a single document should include all of the information used to determine whether it matches a search request. To bridge the gap between two worlds, several common techniques of data modeling are used in Elasticsearch. We'll discuss data denormalization, inner objects, nested objects, and join datatypes in the following sub-sections. No matter what kinds of technique you use, if you don't know the query that the user is running, you can't make good data modeling...
Advanced Elasticsearch 7.0
By :
Advanced Elasticsearch 7.0
By:
Overview of this book
Building enterprise-grade distributed applications and executing systematic search operations call for a strong understanding of Elasticsearch and expertise in using its core APIs and latest features. This book will help you master the advanced functionalities of Elasticsearch and understand how you can develop a sophisticated, real-time search engine confidently. In addition to this, you'll also learn to run machine learning jobs in Elasticsearch to speed up routine tasks.
You'll get started by learning to use Elasticsearch features on Hadoop and Spark and make search results faster, thereby improving the speed of query results and enhancing the customer experience. You'll then get up to speed with performing analytics by building a metrics pipeline, defining queries, and using Kibana for intuitive visualizations that help provide decision-makers with better insights. The book will later guide you through using Logstash with examples to collect, parse, and enrich logs before indexing them in Elasticsearch.
By the end of this book, you will have comprehensive knowledge of advanced topics such as Apache Spark support, machine learning using Elasticsearch and scikit-learn, and real-time analytics, along with the expertise you need to increase business productivity, perform analytics, and get the very best out of Elasticsearch.
Table of Contents (25 chapters)
Preface
Overview of Elasticsearch 7
Index APIs
Document APIs
Mapping APIs
Anatomy of an Analyzer
Search APIs
Section 2: Data Modeling, Aggregations Framework, Pipeline, and Data Analytics
Modeling Your Data in the Real World
Aggregation Frameworks
Preprocessing Documents in Ingest Pipelines
Using Elasticsearch for Exploratory Data Analysis
Section 3: Programming with the Elasticsearch Client
Elasticsearch from Java Programming
Elasticsearch from Python Programming
Section 4: Elastic Stack
Using Kibana, Logstash, and Beats
Working with Elasticsearch SQL
Working with Elasticsearch Analysis Plugins
Section 5: Advanced Features
Machine Learning with Elasticsearch
Spark and Elasticsearch for Real-Time Analytics
Building Analytics RESTful Services
Other Books You May Enjoy
Customer Reviews