Solr provides us with a way to index multilanguage documents in it. In this section, we'll cover how to easily index multilanguage documents in Solr and also how to auto-detect a document language.
Let's create a new core called languages-example. It will contain the following fields in schema.xml
, which we're going to use for our example:
<fields> <field name="id" type="string" indexed="true" stored="true" required="true"/> <field name="content" type="text_general" indexed="true" stored="true" /> <field name="text" type="text_general" multiValued="true" indexed="true" stored="false" /> <copyField source="content" dest="text" /> <field name="language" type="string" stored="true" indexed="true" /> <dynamicField name="*_en" type="text_en" stored="true" indexed="true" /> <dynamicField name="*_ru" type="text_ru" stored="true" indexed="true" /> <dynamicField name="*_fr" type="text_fr" stored="true" indexed...