Book Image

ElasticSearch Blueprints

Book Image

ElasticSearch Blueprints

Overview of this book

Table of Contents (15 chapters)
Elasticsearch Blueprints
Credits
About the Author
About the Reviewer
www.PacktPub.com
Preface
Index

Index

A

  • aggregation
    • about / Aggregating your results
    • terms aggregation / Terms aggregation
  • algorithmic approach
    • about / Improving the search experience using stemming
  • analysis / Data modeling in Elasticsearch
  • analyzers
    • about / Setting the analyzer
    • setting / Setting the analyzer
    • character filters / Setting the analyzer
    • tokenizers / Setting the analyzer
    • token filters / Setting the analyzer
    • custom analyzers, creating / Creating your own analyzer
    • readymade analyzers / Readymade analyzers
  • analyzing
    • about / Setting the analyzer
  • autocomplete
    • about / Autocomplete in Elasticsearch
    • finite state automata (FST), using / How does FST help in faster autocompletes?
    • characteristics / Hotel suggester using autocomplete
    • implementing, in hotel suggester / Hotel suggester using autocomplete

B

  • Boolean operations
    • performing / Boolean operations
  • boost query
    • using / Boosting certain symptoms over others

C

  • case-insensitive search
    • enabling / A case-insensitive search
  • category filter
    • implementing / Implementing a category filter
  • character filters
    • about / Setting the analyzer, Types of character filters
    • HTML stripper / Types of character filters
    • mapping char filter / Types of character filters
  • consistent pagination
    • with scroll API / Using the scroll API for consistent pagination
  • content match
    • title match, prioritizing / Prioritizing a title match over content match
  • curator
    • about / The curator
    • used, for shard allocation / Shard allocation using curator
    • used, for closing indices / Opening and closing of indices
    • used, for opening indices / Opening and closing of indices
    • used, for optimizing indices / Optimization
  • cURL
    • about / Deploying Elasticsearch
    • URL / Deploying Elasticsearch
    • data, specifying in command line / Communicating with the Elasticsearch server
    • data, specifying in file / Communicating with the Elasticsearch server
  • custom analyzers
    • creating / Creating your own analyzer

D

  • data modeling
    • about / Data modeling in Elasticsearch
  • date range filter
    • implementing / Filter your results based on a date range
  • decay (Gauss) functionality
    • about / The nearest hotel problem
  • dictionary-based approach
    • about / Improving the search experience using stemming
  • distance range aggregation
    • about / Restaurant categorization based on distance
  • doc ID
    • about / Communicating with the Elasticsearch server
  • documents
    • searching / Searching your documents
    • searching, with match query / A match query
  • DSL (domain-specific language)
    • about / Choosing between a query and a filter

E

  • e-mail search
    • enabling, inside text / Effective e-mail or URL link search inside text
  • Ebola outbreak records
    • about / The Ebola outbreak
    • sorting / The Ebola outbreak
    • sorting, by boost match in title field over description / Boost match in the title field column over description
    • sorting, by most recent medical journals / Most recently published medical journals
    • sorting, by most recent Ebola report on healthy patients / The most recent Ebola report on healthy patients
    • sorting, by certain symptoms over others / Boosting certain symptoms over others
    • sorting, by medical journals for different interns / Random ordering of medical journals for different interns
    • sorting, by medical journals from closest place to outbreak / Medical journals from the closest place to the Ebola outbreak
    • sorting, from unhealthy places near Ebola outbreak / Medical journals from unhealthy places near the Ebola outbreak
    • sorting, by healthy people from unhealthy locations / Healthy people from unhealthy locations have Ebola symptoms
    • sorting, by order in which symptoms appeared / Relevancy based on the order in which the symptoms appeared
  • EdgeNGram-based analyzer
    • using / Substring matching
  • Elasticsearch
    • deploying / Deploying Elasticsearch
    • about / Deploying Elasticsearch
    • URL / Deploying Elasticsearch
    • features / Communicating with the Elasticsearch server
  • Elasticsearch server
    • communicating with / Communicating with the Elasticsearch server
    • replicas / Shards and replicas
    • shards / Shards and replicas
    • index-type mapping / Index-type mapping
  • explain flag
    • about / How to debug scoring

F

  • filter
    • and query, selecting between / Choosing between a query and a filter
    • date range filter, implementing / Filter your results based on a date range
    • prize range filter, implementing / Implementing a prize range filter
    • category filter, implementing / Implementing a category filter
    • implementing, in Elasticsearch / Implementation of filters in Elasticsearch
  • filters
    • usage / Choosing between a query and a filter
  • finite state automata (FST)
    • about / Autocomplete in Elasticsearch
    • using / How does FST help in faster autocompletes?
  • flight ticket analytics
    • scenario / A flight ticket analytics scenario
    • index creation / Index creation and mapping
    • mapping creation / Index creation and mapping
    • requisites / A case study on analytics requirements
    • male and female distribution, of passengers / Male and female distribution of passengers
    • ticket booking trends / Time-based patterns or trends in booking tickets
    • correlation of departure and arrival of flights / Hottest arrival and departure points
    • correlation of ticket type with time / The correlation of ticket type with time
    • distribution of travel duration / Distribution of the travel duration
    • most preferred hour, for booking tickets / The most preferred or hottest hour for booking tickets
    • most preferred weekday, for travel / The most preferred or hottest weekday for travel
    • correlation, checking for passenger's purpose of visit / The pattern between a passenger's purpose of visit, ticket type, and their sex
    • correlation, checking for passenger's ticket type / The pattern between a passenger's purpose of visit, ticket type, and their sex
    • correlation, checking for passenger's sex / The pattern between a passenger's purpose of visit, ticket type, and their sex
  • function query
    • using / Most recently published medical journals, Medical journals from the closest place to the Ebola outbreak
  • function score query
    • using / Boost match in the title field column over description, Medical journals from unhealthy places near the Ebola outbreak, Healthy people from unhealthy locations have Ebola symptoms

H

  • has_child query/the has_child filter
    • about / The has_child query/the has_child filter
  • has_parent filter/the has_parent query
    • about / The has_parent filter/the has_parent query
  • head UI
    • about / The head UI explained
  • highlighting
    • using / Using the highlighting feature
  • holy box of search
    • about / The holy box of search
    • field search / The field search
    • number/date range search / The number/date range search
    • phrase search / The phrase search
    • wildcard search / The wildcard search
    • Regexp search / The regexp search
  • horizontal scaling
    • about / Shards and replicas
  • hotel suggester
    • autocomplete, implementing / Hotel suggester using autocomplete
  • HTML stripper
    • about / Types of character filters
  • hunspell filter
    • about / Improving the search experience using stemming

I

  • index-type mapping
    • about / Index-type mapping
  • index creation
    • default mappings, overriding / Overriding default mapping and settings in Elasticsearch
    • default settings, overriding / Overriding default mapping and settings in Elasticsearch
    • index template, creating / Index template creation
    • template, deleting / Deleting a template
    • template, retrieving / The GET template
    • multiple templates, matching / Multiple matching of templates
    • default settings, overriding for all indices / Overriding default settings for all indices
    • mapping, overriding for all indices / Overriding mapping of all types under an index
    • default field settings, overriding / Overriding default field settings
    • multiple indices, restoring / Restoring multiple indices

K

  • keyword tokenizer
    • using / Setting the field as not_analyzed
    • about / Using a lowercased analyzer

L

  • length token filter
    • about / Types of token filters
  • lowercased analyzer
    • using / Using a lowercased analyzer
  • lowercase tokenizer
    • about / Types of tokenizers
  • Lucene
    • about / Terms aggregation giving weird results

M

  • mapping char filter
    • about / Types of character filters
  • match query
    • used, for searching documents / A match query
    • multifield match query / Multifield match query
    • using / Boost match in the title field column over description
  • metaphone analyzer
    • about / Words with similar sounds
  • multiple search
    • implementing / Searching with multiple conditions

N

  • nested types
    • using / Nested types to the rescue
    • limitations / Limitations on a query on nested fields
  • news search portal
    • about / News search
    • creating / News search

O

  • older indices
    • closing / Closing older indices
    • snapshot creation / Snapshot creation and restoration of indices
    • restoration / Snapshot creation and restoration of indices
    • repository, creating / Repository creation

P

  • pagination
    • about / Pagination
    • head UI / The head UI explained
  • parent child approach
    • using / Using a parent-child approach
    • has_parent filter/the has_parent query / The has_parent filter/the has_parent query
    • has_child query/the has_child filter / The has_child query/the has_child filter
    • top_children query / The top_children query
  • phonetic analyzer
    • used, for mapping similar sound words / Words with similar sounds
    • URL / Words with similar sounds
    • installing / Words with similar sounds
  • phrase query
    • used, for searching / Using phrase query to search
    • reference link / Using phrase query to search
  • prize range filter
    • implementing / Implementing a prize range filter
  • product-with-tags
    • searching / The product-with-tags search problem

Q

  • query
    • and filter, selecting between / Choosing between a query and a filter
  • question and answer forum
    • schema, designing / Schema design to store questions and answers
    • questions, searching based on criteria of answers / Searching questions based on a criteria of answers
    • answers, searching based on criteria of questions / Searching answers based on a criteria of questions
    • score of question, aggregating / The score of questions based on the score of each answer
    • questions, filtering with more than four answers / Filtering questions with more than four answers
    • best questions, displaying / Displaying the best questions and their accepted answers
    • accepted answer, displaying / Displaying the best questions and their accepted answers

R

  • RAID disks
    • about / Shards and replicas
  • random function
    • using / Random ordering of medical journals for different interns
  • readymade analyzers
    • about / Readymade analyzers
  • replicas
    • about / Shards and replicas
  • restaurant search
    • creating / Restaurant search
    • requisites / Restaurant search
    • data modeling, for restaurants / Data modeling for restaurants
    • nearest restaurants, searching / The nearest hotel problem
    • within maximum distance / The maximum distance covered
    • inside particular city limit / Inside the city limits
    • distance, searching between user location and each restaurant / Distance values between the current point and each restaurant
    • out of city limits / Restaurants out of city limits
    • restaurant categorization, based on distance / Restaurant categorization based on distance
    • restaurant categorization, based on nearness / Aggregating restaurants based on their nearness

S

  • scaling
    • about / Shards and replicas
    • vertical scaling / Shards and replicas
    • horizontal scaling / Shards and replicas
  • scoring
    • about / How scoring works
    • debugging / How to debug scoring
  • scoring, factors
    • term frequency / How scoring works
    • inverse document frequency (IDF) / How scoring works
    • field length / How scoring works
  • scroll API
    • used, for consistent pagination / Using the scroll API for consistent pagination
  • shards
    • about / Shards and replicas
    • filtering / Shard filtering
    • allocating, curator used / Shard allocation using curator
  • shingle tokenizer
    • about / Types of tokenizers
  • simple query string query
    • about / Boolean operations
  • snapshots
    • creating / Snapshot creation
    • creating, on specific indices / Snapshot creation on specific indices
    • restoring / Restoring a snapshot
  • snowball algorithm
    • about / Improving the search experience using stemming
  • Snowball Analyzer
    • about / Readymade analyzers
  • sort
    • implementing / Sorting results
  • Stack Overflow
    • about / Schema design to store questions and answers
  • stemmer token filter
    • about / Types of token filters
  • stemming
    • used, for improving search experience / Improving the search experience using stemming
    • algorithmic approach / Improving the search experience using stemming
    • dictionary-based approach / Improving the search experience using stemming
  • stop words
    • about / Setting the analyzer
  • stop word token filter
    • about / Types of token filters
  • substring matching
    • with EdgeNGram-based analyzer / Substring matching
  • synonym-aware search
    • about / A synonym-aware search
    • implementing / A synonym-aware search

T

  • terms aggregation
    • about / Terms aggregation, Terms aggregation giving weird results
    • fields, setting / Setting the field as not_analyzed
  • time-based data
    • searching / Searching for time-based data
    • archiving / Archiving time-based data
    • shards, filtering / Shard filtering
    • optimized API, running / Running the optimized API on indices where writing is done
  • title match
    • prioritizing, over content match / Prioritizing a title match over content match
  • token filters
    • about / Setting the analyzer
    • stop word token filter / Types of token filters
    • length token filter / Types of token filters
    • stemmer token filter / Types of token filters
  • tokenizers
    • about / Setting the analyzer
    • whitespace tokenizer / Types of tokenizers
    • shingle tokenizer / Types of tokenizers
    • lowercase tokenizer / Types of tokenizers
  • tokens
    • about / Index-type mapping
  • top_children query
    • about / The top_children query

U

  • UAX Email URL tokenizer
    • about / Effective e-mail or URL link search inside text
  • URL link search
    • enabling, inside text / Effective e-mail or URL link search inside text

V

  • vertical scaling
    • about / Shards and replicas

W

  • whitespace tokenizer
    • about / Types of tokenizers

Z

  • Zomato
    • about / Hotel suggester using autocomplete