Book Image

Apache Solr PHP Integration

By : Jayant Kumar
Book Image

Apache Solr PHP Integration

By: Jayant Kumar

Overview of this book

The Search tool is a very powerful for any website. No matter what type of website, the search tool helps visitors find what they are looking for using key words and narrow down the results using facets. Solr is the popular, blazing fast, open source enterprise search platform from the Apache Lucene project. It is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest websites.This book is a practical, hands-on, end-to-end guide that provides you with all the tools required to build a fully-featured search application using Apache Solr and PHP. The book contains practical examples and step-by-step instructions.Starting off with the basics of installing Apache Solr and integrating it with Php, the book then proceeds to explore the features provided by Solr to improve searches using Php. You will learn how to build and maintain a Solr index using Php, discover the query modes available with Solr, and how to use them to tune the Solr queries to retrieve relevant results. You will look at how to build and use facets in your search, how to tune and use fast result highlighting, and how to build a spell check and auto complete feature using Solr. You will finish by learning some of the advanced concepts required to runa large-scale enterprise level search infrastructure.
Table of Contents (15 chapters)
Apache Solr PHP Integration
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Solr highlighting configuration


Solr has two types of highlighters—regular highlighter and fast vector highlighter. The regular highlighter works on most query types but does not scale well to large documents. On the other hand, the fast vector highlighter scales very well to large documents but supports fewer query types. Though personally I have not come across a situation where the fast vector highlighter does not work.

Note

The fast vector highlighter requires termVectors, termPositions, and termOffsets to be set for it to work.

Let us look at the Solr configuration for highlighting. Open up the Solr configuration at <solr_directory>/example/solr/collection1/conf/solrconfig.xml. Search for an XML element searchComponent with attribute class="solr.HighlightComponent" and name="highlight". We can see that there are multiple fragmenters, an HTML formatter, and an HTML encoder defined in the file. We also have multiple fragmentsBuilders, multiple fragListBuilders and multiple boundaryScanners...