Book Image

Scaling Apache Solr

By : Hrishikesh Vijay Karambelkar
Book Image

Scaling Apache Solr

By: Hrishikesh Vijay Karambelkar

Overview of this book

Table of Contents (18 chapters)
Scaling Apache Solr
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Big Data search on Katta


Katta provides highly scalable, fault-tolerant information storage. It is an open source project and uses the underlying Hadoop infrastructure (to be specific, HDFS) to store its indices and provide access to it. Katta has been in the market for the last few years. Recently, development on Katta has been stalled; yet, there are many users who go with Solr-Katta-based integration for Big Data search. Some organizations customize Katta as per their needs and utilize its capabilities for highly scalable search. Katta brings Apache Hadoop and Solr together, enabling search supported by distributed MapReduce cluster. You can read more information about Katta on http://katta.sourceforge.net/.

How Katta works?

Katta can primarily be used with two different functions: first is to generate the Solr index, and the second is to run search on the Hadoop cluster. The following diagram depicts what the Katta architecture looks like:

Katta cluster has a master node called Katta master...