Transforming data when using DIH
Data that is stored in our data source is not always in a form we would like it to be indexed in our Solr cluster. For example, imagine that you want to split the first and second names into two fields during indexing because these two reside in a single column in the database and are separated by a whitespace character. Of course, we can modify our database, but in most cases this is not possible. Can we do this? Of course we can, we just need to add some more configuration details to the Data Import Handler configuration. This recipe will show you how to do this.
Getting ready
Refer to the Indexing data from a database using Data Import Handler recipe in this chapter.
How to do it...
We will reuse the data from the Indexing data from a database using Data Import Handler recipe in this chapter. So, to select users from our table, we use the following SQL query:
SELECT user_id, user_name FROM users
The response in the text client looks as follows:
| user_id | user_name...