Book Image

Scaling Apache Solr

By : Hrishikesh Vijay Karambelkar
Book Image

Scaling Apache Solr

By: Hrishikesh Vijay Karambelkar

Overview of this book

Table of Contents (18 chapters)
Scaling Apache Solr
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 3. Analyzing Data with Apache Solr

Many organizations suffer when dealing with huge amounts of data generated in different formats, due to incremental IT enablement of their business processes. Dealing with vast varieties of data becomes a challenge for any enterprise search engine. This data may reside in a database, or would be streamed over HTTP protocol. To address these problems, many companies provided tools to bring in data from various sources into one form. These were Extract Transfer Load (ETL) tools mainly used for business intelligence (BI) and analytics solutions. Luckily, Apache Solr provides different ways of dealing with different data types, when it comes down to information collection. We have already read about indexing in Chapter 1, Understanding Apache Solr. In this chapter, we are going to look at analyzing different types of data and how to deal with them. We will focus on the following topics:

  • Configuring handlers

  • Apache Tika integration

  • Importing data from...