Index
A
- aggregations
- about / Aggregations
- bucketing / Aggregations
- metrics / Aggregations
- queries, executing / Executing the aggregation queries
- terms aggregation / The terms aggregation
- histogram aggregation / Histograms
- range aggregation / The range aggregation
- geo distance aggregation / The geo distance
- subaggregations / Sub-aggregations
- Analyzer
- about / Analyzers
- Apache Storm
- setting up / Setting up Apache Storm
- area chart
- creating / The area chart
- authentication configurations
- es.net.http.auth.user / es.net.http.auth.user
- es.net.http.auth.pass / es.net.http.auth.pass
B
- basic configurations
- es.resource / es.resource
- es.resource.read / es.resource.read
- es.resource.write / es.resource.write
- es.nodes / es.nodes
- es.port / es.port
- bolt
- boolean query
- about / The boolean query
C
- cascading
- about / Cascading with Elasticsearch
- data, importing to Elasticsearch / Importing data to Elasticsearch
- job, writing / Writing a cascading job
- job, running / Running the job
- data, reading from Elasticsearch / Reading data from Elasticsearch
- reader job, writing / Writing a reader job
- Lingual, using / Using Lingual with Elasticsearch
- certificate
- setting up / Installing SSH and setting up the certificate
- classifications
- performing / Classifying tweets using percolators
- percolator, using / Percolator
- percolator query, building / Building a percolator query effectively
- tweets, classifying / Classifying tweets
- client node
- about / The client node
- cluster, production environment
- setting up / Setting up the cluster
- recommended cluster topology / The recommended cluster topology
- naming / Set names
- paths, configuring / Paths
- memory configurations / Memory configurations
- split-brain problem / The split-brain problem
- recovery configurations / Recovery configurations
- clusters
- about / Elasticsearch clusters and nodes
- administration / Administration of clusters
- cluster health, monitoring / Monitoring the cluster health
- snapshot / Snapshot and restore
- restoring / Snapshot and restore
- CRUD operations
- performing / CRUD with Elasticsearch
- document request, creating / Creating the document request
D
- data
- discovering / Discovering data
- visualizing / Visualizing the data
- dynamic dashboards, creating / Creating dynamic dashboards
- data, cluster
- about / Data inside clusters
- shards / Shards
- replica / Replicas
- data colocation
- about / Data colocation
- data node
- about / The data node
- DataNode
- dedicated user
- setting up / Setting up a dedicated user
- distributed environment, Elasticsearch
- about / Elasticsearch in a distributed environment
- clusters / Elasticsearch clusters and nodes
- nodes / Elasticsearch clusters and nodes
- data, inside cluster / Data inside clusters
- document
- about / Document
- document request
- creating / Creating the document request
- GET request / The GET request
- Update request / The Update request
- Delete request / The Delete request
- index, creating / Creating the index
- dynamic dashboards
- creating / Creating dynamic dashboards
- migrating / Migrating the dashboards
- dynamic parallelism
- about / Dynamic parallelism
- data, reading from Elasticsearch / Reads from Elasticsearch
- failure handling / Failure handling
E
- Elasticsearch
- Hadoop, setting up / Setting up Hadoop for Elasticsearch
- setting up / Setting up Elasticsearch
- downloading / Downloading Elasticsearch
- configuring / Configuring Elasticsearch
- Head plugin, installing / Installing Elasticsearch's Head plugin
- Marvel plugin, installing / Installing the Marvel plugin
- running / Running and testing
- testing / Running and testing
- data, importing to HDFS / Getting data from Elasticsearch to HDFS
- data, reading with Pig / Reading data from Elasticsearch
- with Hive / SQLizing Elasticsearch with Hive
- ES-Hadoop
- architecture / The ES-Hadoop architecture
- dynamic parallelism / Dynamic parallelism
- data colocation / Data colocation
- on Yet Another Resource Manager (YARN) / ES-Hadoop on YARN
- basic configurations / Basic configurations
- write and query configurations / Write and query configurations
- mapping configurations / Mapping configurations
- index configurations / Index configurations
- network configurations / Network configurations
- authentication configurations / Authentication configurations
- SSL configurations / SSL configurations
- proxy configurations / Proxy configurations
- exists filter
- about / The exists filter
F
- field
- about / Field
- filters
- about / Filters
- exists filter / The exists filter
- geo distance filter / The geo distance filter
- format distributed filesystem
G
- geo distance aggregation
- about / The geo distance
- geo distance filter
- about / The geo distance filter
- geographical chart
- creating / The geographical chart
H
- Hadoop
- setting up, for Elasticsearch / Setting up Hadoop for Elasticsearch
- Java, setting up / Setting up Java
- dedicated user, setting up / Setting up a dedicated user
- SSH, installing / Installing SSH and setting up the certificate
- certificate, setting up / Installing SSH and setting up the certificate
- downloading / Downloading Hadoop
- environment variables, setting up / Setting up environment variables
- configuring / Configuring Hadoop
- core-site.xml, configuring / Configuring core-site.xml
- hdfs-site.xml, configuring / Configuring hdfs-site.xml
- yarn-site.xml, configuring / Configuring yarn-site.xml
- mapred-site.xml, configuring / Configuring mapred-site.xml
- format distributed filesystem / The format distributed filesystem
- starting / Starting Hadoop daemons
- hardware, production environment
- HDFS
- data, importing from Elasticsearch / Getting data from Elasticsearch to HDFS
- Twitter dataset / Understanding the Twitter dataset
- Twitter dataset, loading / Trying it yourself
- MapReduce job, creating / Creating the MapReduce job to import data from Elasticsearch to HDFS
- Head plugin
- installing / Installing Elasticsearch's Head plugin
- data, exploring / Exploring data in Head and Marvel
- data, viewing / Viewing data in Head
- histogram aggregation
- about / Histograms
- Hive
- setting up / Setting up Apache Hive
- data, importing to Elasticsearch / Importing data to Elasticsearch
- data, writing from JSON source / Writing from the JSON source
- type conversions / Type conversions
- data, reading from Elasticsearch / Reading data from Elasticsearch
I
- index
- about / Index
- process, controlling / Controlling the indexing process
- inverted index / What is an inverted index?
- index configurations
- es.index.auto.create / es.index.auto.create
- es.index.read.missing.as.empty / es.index.read.missing.as.empty
- es.field.read.empty.as.null / es.field.read.empty.as.null
- es.field.read.validate.presence / es.field.read.validate.presence
- input data analysis, inverted index
- stop words, removing / Removing stop words
- case insensitive / Case insensitive
- stemming / Stemming
- synonyms / Synonyms
- analyzers / Analyzers
- inverted index
- about / What is an inverted index?
- input data analysis / The input data analysis
J
- Java
- setting up / Setting up Java
K
- Kibana
- setting up / Setting up Kibana
- datasets, setting up / Setting up datasets
- configuring / Getting started with Kibana
L
- Lingual
- using / Using Lingual with Elasticsearch
M
- mapping configurations
- es.mapping.id / es.mapping.id
- es.mapping.parent / es.mapping.parent
- es.mapping.version / es.mapping.version
- es.mapping.version.type / es.mapping.version.type
- es.mapping.routing / es.mapping.routing
- es.mapping.ttl / es.mapping.ttl
- es.mapping.timestamp / es.mapping.timestamp
- es.mapping.date.rich / es.mapping.date.rich
- es.mapping.include / es.mapping.include
- es.mapping.exclude / es.mapping.exclude
- mappings
- about / Mappings
- data types / Data types
- mapping API, creating / Create mapping API
- index templates / Index templates
- MapReduce job
- creating / Creating the MapReduce job to import data from Elasticsearch to HDFS
- Tweets2HdfsMapper class, writing / Writing the Tweets2Hdfs mapper
- running / Running the example
- testing / Testing the job execution output
- Marvel plugin
- installing / Installing the Marvel plugin
- data, exploring / Exploring data in Head and Marvel
- using / Using the Marvel dashboard
- data, exploring in Sense / Exploring the data in Sense
- master node
- about / The master node
- match query
- about / The match query
- match_all query
- about / Matching all queries
- multicast discovery
- about / Multicast discovery
N
- NameNode
- network configurations
- es.nodes.discovery / es.nodes.discovery
- es.nodes.client.only / es.nodes.client.only
- es.http.timeout / es.http.timeout
- es.http.retries / es.http.retries
- es.scroll.keepalive / es.scroll.keepalive
- es.scroll.size / es.scroll.size
- es.action.heart.beat.lead / es.action.heart.beat.lead
- network logs mapper job
- writing / Writing the NetworkLogsMapper job
- mapper class, writing / Writing the mapper class
- Driver class, writing / Writing Driver
- building / Building the job
- data, obtaining into HDFS / Getting the data into HDFS
- running / Running the job
- Top N results, viewing / Viewing the Top N results
- network monitoring data
- about / Going real — network monitoring data
- obtaining / Getting and understanding the data
- issues, identifying / Knowing the problems
- solution approaches / Solution approaches
- results, preaggregating / Approach 1 – Preaggregate the results
- results, aggregating at query-time / Approach 2 – Aggregate the results at query-time
- node discovery
- about / Node discovery
- multicast discovery / Multicast discovery
- unicast discovery / Unicast discovery
- nodes
- about / Elasticsearch clusters and nodes
- types / Node types
- master node / The master node
- data node / The data node
- client node / The client node
- tribe node / Tribe nodes
P
- paradigm mismatch
- about / The paradigm mismatch
- percolator
- using / Percolator
- query, building / Building a percolator query effectively
- pie chart
- creating / The pie chart
- Pig
- about / Pigging out Elasticsearch
- setting up / Setting up Apache Pig for Elasticsearch
- data, importing to Elasticsearch / Importing data to Elasticsearch
- JSON source, writing from / Writing from the JSON source
- type conversions / Type conversions
- data, reading from Elasticsearch / Reading data from Elasticsearch
- production environment
- configuring / Configuring the environment for production
- hardware / Hardware
- configuration presets / Configuration presets
- deployment checklist / Bonus – the production deployment checklist
- production environment, configuration presets
- indexing process, optimizing / Rapid indexing
- full text search, lightening / Lightening a full text search
- aggregations / Faster aggregations
- proxy configurations
- es.net.proxy.http.host / es.net.proxy.http.host
- es.net.proxy.http.port / es.net.proxy.http.port
- es.net.proxy.http.user / es.net.proxy.http.user
- es.net.proxy.http.pass / es.net.proxy.http.pass
- es.net.proxy.http.use.system.props / es.net.proxy.http.use.system.props
- es.net.proxy.socks.host / es.net.proxy.socks.host
- es.net.proxy.socks.port / es.net.proxy.socks.port
- es.net.proxy.socks.user / es.net.proxy.socks.user
- es.net.proxy.socks.pass / es.net.proxy.socks.pass
- es.net.proxy.socks.use.system.props / es.net.proxy.socks.use.system.props
Q
- queries
- using / Elastic searching
- search queries, writing / Writing search queries
R
- range aggregation
- about / The range aggregation
- range query
- about / The range query
- RDD (resilient-distributed datasets)
- about / Giving Spark to Elasticsearch
- replica
- about / Replicas
S
- Search
- about / Knowing Search and Elasticsearch
- Unstructured search / Knowing Search and Elasticsearch
- Structured search / Knowing Search and Elasticsearch
- Geo Spatial search / Knowing Search and Elasticsearch
- Analytics / Knowing Search and Elasticsearch
- Combined / Knowing Search and Elasticsearch
- search queries
- writing / Writing search queries
- URI search / The URI search
- match_all query / Matching all queries
- term query / The term query
- boolean query / The boolean query
- match query / The match query
- range query / The range query
- wildcard query / The wildcard query
- filters / Filters
- Sense
- data, exploring / Exploring the data in Sense
- shards
- about / Shards
- allocation / Shard allocation
- snapshot
- data, backing up / Backing up your data
- data, restoring / Restoring your data
- Spark
- about / Giving Spark to Elasticsearch
- setting up / Setting up Spark
- data, importing to Elasticsearch / Importing data to Elasticsearch
- data, importing with SparkSQL / Using SparkSQL
- data, reading from Elasticsearch / Reading data from Elasticsearch
- data, reading with SparkSQL / Using SparkSQL
- split pie chart
- creating / The split pie chart
- spout
- SSH
- installing / Installing SSH and setting up the certificate
- SSL configurations
- es.net.ssl / es.net.ssl
- es.net.ssl.keystore.location / es.net.ssl.keystore.location
- es.net.ssl.keystore.pass / es.net.ssl.keystore.pass
- es.net.ssl.keystore.type / es.net.ssl.keystore.type
- es.net.ssl.truststore.location / es.net.ssl.truststore.location
- es.net.ssl.truststore.pass / es.net.ssl.truststore.pass
- es.net.ssl.cert.allow.self.signed / es.net.ssl.cert.allow.self.signed
- es.net.ssl.protocol / es.net.ssl.protocol
- es.scroll.size / es.scroll.size
- stacked bar char
- stacked bar chart
- creating / The stacked bar chart
- Storm
- streaming data, injecting / Injecting streaming data into Storm
- spout, writing / Writing a Storm spout
- bolts, writing / Writing Storm bolts
- topology, creating / Creating a Storm topology
- job, running / Building and running a Storm job
- Storm Spout
- about / What are we trying to do?
- stream
- streaming data
- injecting, into Storm / Injecting streaming data into Storm
- subaggregations
- about / Sub-aggregations
- sun burst chart
- creating / The sun burst chart
T
- term query
- about / The term query
- terms aggregation
- about / The terms aggregation
- Tokenizer filter
- about / Analyzers
- topology
- tribe node
- about / Tribe nodes
- tuple
- Twitter Trend Analyzer
- about / Getting started with the Twitter Trend Analyser
- developing / What are we trying to do?
- Apache Storm, setting up / Setting up Apache Storm
- trends, analyzing / Analyzing trends
- significant terms aggregation, using / Significant terms aggregation
- trends, viewing in Kibana / Viewing trends in Kibana
- type, Elasticsearch
- about / Type
U
- UDFs (user-defined functions)
- about / Importing data to Elasticsearch
- unicast discovery
- about / Unicast discovery
- URI search
- about / The URI search
V
- visualization, data
- about / Visualizing the data
- pie chart / The pie chart
- stacked bar chart, creating / The stacked bar chart, The date histogram with the stacked bar chart
- area chart, creating / The area chart
- split pie chart, creating / The split pie chart
- sun burst chart, creating / The sun burst chart
- geographical chart, creating / The geographical chart
W
- wildcard query
- about / The wildcard query
- WordCount example
- running / Running the WordCount example
- job JAR file, building / Getting the examples and building the job JAR file
- test file, importing to HDFS / Importing the test file to HDFS
- job JAR file, running / Running our first job
- about / Understanding the WordCount program
- Mapper / Understanding Mapper
- reducer / Understanding the reducer
- driver / Understanding the driver
- old MapReduce API, using / Using the old API – org.apache.hadoop.mapred
- write and query configurations
- es.query / es.query
- es.input.json / es.input.json
- es.write.operation / es.write.operation
- es.update.script / es.update.script
- es.update.script.lang / es.update.script.lang
- es.update.script.params / es.update.script.params
- es.update.script.params.json / es.update.script.params.json
- es.batch.size.bytes / es.batch.size.bytes
- es.batch.size.entries / es.batch.size.entries
- es.batch.write.refresh / es.batch.write.refresh
- es.batch.write.retry.count / es.batch.write.retry.count
- es.batch.write.retry.wait / es.batch.write.retry.wait
- es.ser.reader.value.class / es.ser.reader.value.class
- es.ser.writer.value.class / es.ser.writer.value.class
- es.update.retry.on.conflict / es.update.retry.on.conflict
Y
- Yet Another Resource Manager (YARN)
- about / ES-Hadoop on YARN
- ES-Hadoop support / ES-Hadoop on YARN