-
Book Overview & Buying
-
Table Of Contents
Turning Text into Gold: Taxonomies and Textual Analytics
By :
Basic Refinements
In the previous example, it would be wise to include the specific word preceded by a blank space in the search logic. For example, “ ford” instead of “ford”. Including the blank space would avoid retrieving the word “afford”.
Also note that the raw text is always searched as a lower case word. And any and all punctuations are removed. The raw text is searched as lower case in order to not misidentify a hit. For example, in the sentence “My Porsche runs fast.” the word “Porsche” needs to be identified as “Porsche”. Doing searches on a single case makes searching much more efficient. It has been observed that approximately 75% of raw text is served by these few basic refinements to taxonomy processing. The remaining 25% of text requires other techniques.
Change the font size
Change margin width
Change background colour