Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Mastering Apache Solr 7.x

By : Sandeep Nair, Chintan Mehta, Dharmesh Vasoya

3.8 (5)

Mastering Apache Solr 7.x

3.8 (5)

By: Sandeep Nair, Chintan Mehta, Dharmesh Vasoya

Overview of this book

Apache Solr is the only standalone enterprise search server with a REST-like application interface. providing highly scalable, distributed search and index replication for many of the world's largest internet sites. To begin with, you would be introduced to how you perform full text search, multiple filter search, perform dynamic clustering and so on helping you to brush up the basics of Apache Solr. You will also explore the new features and advanced options released in Apache Solr 7.x which will get you numerous performance aspects and making data investigation simpler, easier and powerful. You will learn to build complex queries, extensive filters and how are they compiled in your system to bring relevance in your search tools. You will learn to carry out Solr scoring, elements affecting the document score and how you can optimize or tune the score for the application at hand. You will learn to extract features of documents, writing complex queries in re-ranking the documents. You will also learn advanced options helping you to know what content is indexed and how the extracted content is indexed. Throughout the book, you would go through complex problems with solutions along with varied approaches to tackle your business needs. By the end of this book, you will gain advanced proficiency to build out-of-box smart search solutions for your enterprise demands.

Preface

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Introduction to Solr 7

Introduction to Solr 7

Introduction to Solr

Why choose Solr?

Solr use cases

What's new in Solr 7?

Summary

Getting Started

Getting Started

Solr installation

Understanding various files and the folder structure

Running Solr

Loading sample data

Understanding the browse interface

Using the Solr admin interface

Summary

Designing Schemas

Designing Schemas

How Solr works

Understanding field types

Field management

Mastering Schema API

Deciphering schemaless mode

Summary

Mastering Text Analysis Methodologies

Mastering Text Analysis Methodologies

Understanding text analysis

Understanding analyzer

Understanding tokenizers

Understanding filters

Understanding multilingual analysis

Understanding phonetic matching

Summary

Data Indexing and Operations

Data Indexing and Operations

Basics of Solr indexing

Understanding index handlers

Apache Tika and indexing

Language detection

Client APIs

Summary

Advanced Queries – Part I

Advanced Queries – Part I

Search relevance

Velocity search UI

Query parsing and syntax

Response writer

Faceting

Highlighting

Summary

Advanced Queries – Part II

Advanced Queries – Part II

Spellchecking

Suggester

Pagination

Result grouping

Result clustering

Spatial search

Summary

Managing and Fine-Tuning Solr

Managing and Fine-Tuning Solr

JVM configuration

Managing solrconfig.xml

Managing backups

JMX with Solr

Logging configuration

SolrCloud overview

Enabling SSL – Solr security

Performance statistics

Summary

Client APIs – An Overview

Client APIs – An Overview

Client API overview

JavaScript Client API

SolrJ Client API

Ruby Client API

Python Client API

Summary

Introduction to Solr 7

Today we are in the age of digitization. People are generating data in different ways: they take pictures, upload images, write blogs, comment on someone's blog or picture, change their status on social networking sites, tweet on Twitter, update details on LinkedIn, do financial transactions, write emails, store data on the cloud, and so on. Data size has grown not only in the personal space but also in professional services, where people have to deal with a humongous amount of data. Think of the data managed by players such as Google, Facebook, the New York Stock Exchange, Amazon, and many others. For this data tsunami, we need the appropriate tools to fetch data, in an organized way, that can be used in various fields, such as scientific research, real-time traffic, fighting crime, fraud detection, digital personalization, and so on. All of this data needs to be captured, stored, searched, shared, transferred, analyzed, and visualized.

Analyzing structured, unstructured, or semi-structured ubiquitous data helps us discover hidden patterns, market trends, correlations, and personal preferences. With the help of the right tools to process and analyze data, organizations can expect much better marketing plans, additional revenue opportunities, improved customer services, healthier operational efficiency, competitive benefits, and much more. It is important to not only store data but also process it in order to generate information that is necessary. Every company collects data and uses it; however, to potentially flourish more effectively, a company needs to search relevant data. Every company must carve out direct search-produced data, which can improve their business either directly or indirectly.

Okay, now you have Solr, which is generally referred to as search server, and you are doing searches. Is that what you need? Hold on! This allows a lot more than a simple search. So get ready and hold your breath to take a deep dive into Solr—a scalable, flexible, and enterprise NoSQL search platform!

We will go through the following topics in this chapter:

Introduction to Solr
Why Solr?
Solr use cases
What's new in Solr 7

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Mastering Apache Solr 7.x

Search

Your notes and bookmarks