Learning Elastic Stack 6.0

Learning Elastic Stack 6.0

By : Pranav Shukla, Sharath Kumar M N

Buy this Book

Learning Elastic Stack 6.0

By: Pranav Shukla, Sharath Kumar M N

Buy this Book

Overview of this book

The Elastic Stack is a powerful combination of tools for distributed search, analytics, logging, and visualization of data from medium to massive data sets. The newly released Elastic Stack 6.0 brings new features and capabilities that empower users to find unique, actionable insights through these techniques. This book will give you a fundamental understanding of what the stack is all about, and how to use it efficiently to build powerful real-time data processing applications. After a quick overview of the newly introduced features in Elastic Stack 6.0, you’ll learn how to set up the stack by installing the tools, and see their basic configurations. Then it shows you how to use Elasticsearch for distributed searching and analytics, along with Logstash for logging, and Kibana for data visualization. It also demonstrates the creation of custom plugins using Kibana and Beats. You’ll find out about Elastic X-Pack, a useful extension for effective security and monitoring. We also provide useful tips on how to use the Elastic Cloud and deploy the Elastic Stack in production environments. On completing this book, you’ll have a solid foundational knowledge of the basic Elastic Stack functionalities. You’ll also have a good understanding of the role of each component in the stack to solve different data processing problems.

Title Page

Credits

Disclaimer

About the Authors

About the Reviewer

www.PacktPub.com

Customer Feedback

Preface

Free Chapter

Introducing Elastic Stack

What is Elasticsearch, and why use it?

Exploring the components of Elastic Stack

Use cases of Elastic Stack

Downloading and installing

Summary

Getting Started with Elasticsearch

Using the Kibana Console UI

Core concepts

CRUD operations

Creating indexes and taking control of mapping

REST API overview

Summary

Searching-What is Relevant

Basics of text analysis

Searching from structured data

Searching from full text

Writing compound queries

Summary

Analytics with Elasticsearch

The basics of aggregations

Preparing data for analysis

Metric aggregations

Bucket aggregations

Pipeline aggregations

Summary

Analyzing Log Data

Log analysis challenges

Logstash architecture

Overview of Logstash plugins

Ingest node

Summary

Building Data Pipelines with Logstash

Parsing and enriching logs using Logstash

Introducing Beats

Filebeat

Summary

Visualizing data with Kibana

Downloading and installing Kibana

Summary

Elastic X-Pack

Installing X-Pack

Configuring X-Pack

Security

Monitoring Elasticsearch

Alerting

Summary

Running Elastic Stack in Production

Hosting Elastic Stack on a managed cloud

Hosting Elastic Stack on your own

Backing up and restoring

Setting up index aliases

Setting up index templates

Modeling time series data

Summary

Building a Sensor Data Analytics Application

Introduction to the application

Modeling data in Elasticsearch

Setting up the metadata database

Building the Logstash data pipeline

Sending data to Logstash over HTTP

Visualizing the data in Kibana

Summary

Monitoring Server Infrastructure

Metricbeat

Configuring Metricbeat

Capturing system metrics

Deploymezs architecture

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Modeling time series data

Often, we have a need to store time series data in Elasticsearch. Typically, one would create a single index to hold all documents. This typical approach of one big index to hold all documents has its own limitations, especially for the following reasons:

Scaling the index with an unpredictable volume over time
Changing the mapping over time
Automatically deleting older documents

Let's look at how each problem manifests itself when we choose a single monolithic index.

Scaling the index with unpredictable volume over time

One of the most difficult choices when creating an Elasticsearch cluster and its indices is deciding how many primary shards should be created and how many replica shards should be created.

Let's understand how the number of shards becomes important in the following sub sections:

Unit of parallelism in Elasticsearch:
- The effect of the number of shards on the relevance score
- The effect of the number of shards on the accuracy of aggregations

Learning Elastic Stack 6.0

By : Pranav Shukla, Sharath Kumar M N

Learning Elastic Stack 6.0

By: Pranav Shukla, Sharath Kumar M N

Overview of this book

Related Content you might be interested in

Current Title:

Learning Elastic Stack 6.0

Mastering Elastic Stack

Mastering Kibana 6.x

Learning Elasticsearch

Modeling time series data

Scaling the index with unpredictable volume over time

Unit of parallelism...