We explored text tagging with the help of Lucene and Solr in this chapter. We understood what FSTs
are and how they are implemented in Lucene. We also went through some well-known text tagging algorithms and got a brief idea of how text tagging is implemented in Solr. We explored the SolrTextTagger
package by installing it as a module in Solr and saw some examples of text tagging using this package.
This is the last chapter in this book. In our journey throughout this book, we went through Solr indexing internals where we saw the roles of analyzers
and tokenizers
in index creation. We also saw multi-lingual search and discussed the challenges in large-scale indexing and the solutions to these problems. We then saw how Solr's scoring algorithm can be tweaked and customized. We discussed some existing algorithms and concept scoring algorithms. In the next chapter, we explored Solr internals and learnt how the relevancy scoring algorithm works on the inverted index. We delved into the...