Book Image

Administrating Solr

By : Surendra Mohan
Book Image

Administrating Solr

By: Surendra Mohan

Overview of this book

Implementing different search engines on web products is a mandate these days. Apache Solr is a robust search engine, but simply implementing Apache Solr and forgetting about it is not a good idea, especially when you have to fight for the search ranking of your web product. In such a scenario, you need to keep monitoring, administrating, and optimizing your Solr to retain your ranking. "Administrating Solr" is a practical, hands-on guide. This book will provide you with a number of clear, step-by-step exercises and some advanced concepts which will help you administrate, monitor, and optimize Solr using Drupal and associated scripts. Administrating Solr will also provide you with a solid grounding on how you can use Apache Solr with Drupal. "Administrating Solr" starts with an overview of Apache Solr and the installation process to get you familiar with Solr. It then gradually moves on to discuss the mysteries that make Solr flexible enough to render appropriate search results in different scenarios. This book will take you through clear and practical concepts that will help you monitor, administrate, and optimize your Solr appropriately using both scripts and tools. This book will also teach you ways to query your search and methods to keep your Solr healthy and well maintained. With this book, you will learn how to effectively implement and optimize Solr using Drupal.
Table of Contents (12 chapters)

Chapter 4. Optimizing Solr Tools and Scripts

In the previous chapter, we learned about Solr scripts such as scripts.conf and init script, writing scripts to take Solr backups and to configure Solr logs, and collection distribution scripts.

In this chapter, we will learn how to optimize Solr tools and scripts:

  • Business rules

  • Language detection

  • OpenNLP(Natural Language Processing)

  • Solr operation tool implementation with Drupal 7

We will learn what business rules are, when, where, and how to use it and how to write your custom rule using Drools; what is language detection, comparative study of different language detections such as CLD, LangDetect, and Tika, how to configure LangDetect and Tika; what is NLP, how and where it can be used, what is OpenNLP, how does it function and what the different phases OpenNLP consists of; and how to implement Solr operation tool using Drupal 7, and the corresponding contributed Drupal modules.

Let's get started.