Book Image

Scaling Apache Solr

By : Hrishikesh Vijay Karambelkar
Book Image

Scaling Apache Solr

By: Hrishikesh Vijay Karambelkar

Overview of this book

Table of Contents (18 chapters)
Scaling Apache Solr
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Searching Solr documents in near real time


Apache Solr performs indexing on the updated data, and the data is available for search. In many cases, the index-generation job is run during offline hours (late nights, weekends) to update the search with the newer data. Until this point, Apache Solr cannot search for documents added in the customer document repository. Many times, the demand is to make a document available for search as and when it is uploaded to the customer repository. Apache Solr can perform search on these documents in near real time. There is a delay to generate index for a document and to make it available for search; hence, Apache Solr can support near real-time search on documents.

Strategies for near real-time search in Apache Solr

In Apache Solr, a commit operation is required to enable the document to be made available for searching. Commit operation in Apache Solr involves Solr access transaction logs (or update logs), which pick the identifiers and sync the index files...