Book Image

Scaling Apache Solr

By : Hrishikesh Vijay Karambelkar
Book Image

Scaling Apache Solr

By: Hrishikesh Vijay Karambelkar

Overview of this book

Table of Contents (18 chapters)
Scaling Apache Solr
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Understanding SolrCloud


SolrCloud provides a new way to enable distributed enterprise search using Apache Solr in enterprises. Previously, with the standard distributed Solr support, a lot of the manual work has been automated by SolrCloud. With the introduction of SolrCloud, the manual steps such as configuring solr-config.xml to talk with shards, adding documents to the shards, and similar type of work is automatic. Unlike the traditional approach of master- or slave-based distributed Solr, SolrCloud provides a leader-replica-based approach as its implementation. SolrCloud runs on top of Apache Zookeeper. First, let's understand Zookeeper.

Why Zookeeper?

SolrCloud contains a cluster of nodes, which talk with one another through Apache Zookeeper. Apache Zookeeper is responsible for maintaining coordination among various nodes. Besides coordinating among nodes, it also maintains configuration information, and group services to the distributed system. Due to its in-memory management of information...