Book Image

Scaling Apache Solr

By : Hrishikesh Vijay Karambelkar
Book Image

Scaling Apache Solr

By: Hrishikesh Vijay Karambelkar

Overview of this book

Table of Contents (18 chapters)
Scaling Apache Solr
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Enabling search result clustering with Carrot2


Carrot2 is an application suite to provide document cluster on top of your dataset. It analyzes a set of documents and classifies them into multiple groups based on similarity. The similarities between the documents on various aspects can be used to cluster the documents in different groups. Cluster analysis is very useful, because it provides automatic categorization of your information so that users can browse a flat information model hierarchically.

Note

Cluster analysis techniques are useful in organization for statistical analysis. For example, a cluster analysis over the customer database enables an organization to classify the customers based on different patterns and plan the future strategies to improve upon the sales. Such analysis can be done using different algorithms research done by many statisticians. Among the famous ones are partitioning and hierarchy based.

Carrot2 is BSD licensed, and it can be utilized with any search engine...