Apache Solr Enterprise Search Server

Apache Solr Enterprise Search Server - Third Edition

By : David Smiley, Eric Pugh, Kranti Parisa, Matt Mitchell

Buy this Book

Apache Solr Enterprise Search Server - Third Edition

By: David Smiley, Eric Pugh, Kranti Parisa, Matt Mitchell

Buy this Book

Overview of this book

<p>Solr Apache is a widely popular open source enterprise search server that delivers powerful search and faceted navigation features—features that are elusive with databases. Solr supports complex search criteria, faceting, result highlighting, query-completion, query spell-checking, relevancy tuning, geospatial searches, and much more.</p> <p>This book is a comprehensive resource for just about everything Solr has to offer, and it will take you from first exposure to development and deployment in no time. Even if you wish to use Solr 5, you should find the information to be just as applicable due to Solr's high regard for backward compatibility. The book includes some useful information specific to Solr 5.</p>

Apache Solr Enterprise Search Server Third Edition

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Quick Starting Solr

An introduction to Solr

A few differences between Solr 4 and Solr 5

Resources outside this book

Summary

Schema Design

Is Solr schemaless?

MusicBrainz.org

One combined index or separate indices

Schema design

The schema.xml file

Summary

Text Analysis

Configuring field types

Character filters

Tokenization

Filtering

The multilingual search

Summary

Indexing Data

Communicating with Solr

Solr's Update-XML format

Commit, optimize, and rollback the transaction log

Atomic updates and optimistic concurrency

Sending CSV-formatted data to Solr

The DataImportHandler framework

Indexing documents with Solr Cell

Update request processors

Summary

Searching

Your first search – a walk-through

Solr's generic XML structured data representation

Solr's XML response format

Understanding request handlers

Query parameters

Query parsers and local-params

Query syntax (the lucene query parser)

The DisMax query parser – part 1

Filtering

Sorting

Joining

Spatial search

Summary

Search Relevancy

Scoring

The DisMax query parser – part 2

Functions and function queries

Summary

Faceting

A quick example – faceting release types

Field requirements

Types of faceting

Faceting field values

Faceting numeric and date ranges

Facet queries

Building a filter query from a facet

Pivot faceting

Excluding filters – multiselect faceting

Summary

Search Components

About components

The highlight component

The SpellCheck component

Query complete/suggest

The QueryElevation component

The MoreLikeThis component

The Stats component

The Clustering component

Collapsing and expanding

The TermVector component

Summary

Integrating Solr

Working with the included examples

Solritas – the integrated search UI

SolrJ – Solr's Java client API

Using JavaScript/AJAX with Solr

Using XSLT to transform XML search results

Accessing Solr from PHP applications

Ruby on Rails integrations

Nutch for crawling web pages

Solr and Hadoop

ManifoldCF – a connector framework

Document-level security

Summary

Scaling Solr

Tuning complex systems is hard

Use SolrMeter to test Solr performance

Optimizing a single Solr server – scale up

Configuring Solr for near real-time search

Use SolrCloud to go big – scale wide

Summary

Deployment

Deployment methodology for Solr

Installing Solr into a Servlet container

Configuring logging

A RequestHandler per search interface

Leveraging Solr cores

Setting up ZooKeeper for SolrCloud

Monitoring Solr performance

Securing Solr from prying eyes

Summary

Quick Reference

Core search

Diagnostic

The Lucene query parser

The DisMax query parser

The Lucene query syntax

Faceting

Highlighting

Spell checking

Miscellaneous nonsearch

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Solr and Hadoop

Apache Hadoop and the big data ecosystem have exploded in popularity and most developers are at least loosely familiar with it. Needless to say, there are many pieces of the Hadoop ecosystem that work together to form a big data platform. It's mostly an a-la-carte world in which you combine the pieces you want, each having different uses, or makes different trade-offs between ease-of-coding and performance. What does Solr have to do with Hadoop, you may ask? Read on.

HDFS

As an alternative to a standard filesystem, Solr can store its indexes in Hadoop Distributed File System (HDFS). HDFS acts like a shared filesystem for Solr, somewhat like how networked storage is (for example, a SAN), but is implemented at the application layer instead of at the OS or hardware layer. HDFS offers almost limitless growth, and you can increase storage incrementally without restarting or reconfiguring the server processes supporting it. HDFS has redundancy too, although this is extra-redundant...

Apache Solr Enterprise Search Server - Third Edition

By : David Smiley, Eric Pugh, Kranti Parisa, Matt Mitchell

Apache Solr Enterprise Search Server - Third Edition

By: David Smiley, Eric Pugh, Kranti Parisa, Matt Mitchell

Overview of this book

Related Content you might be interested in

Current Title:

Apache Solr Enterprise Search Server - Third Edition

Solr and Hadoop

HDFS