Jul. 03, 2019
Karan Jeet Singh
|
Some of our SearchStax clients index websites that use multiple languages. We were recently asked how to enable Solr indexing of Mandarin on a cloud platform. (This post describes indexing Traditional Chinese characters. It is also possible to use Simplified Chinese by following a similar series of steps. Contact us at support@searchstax.com for an example.)
Update solrconfig.xml file by adding following line after all the lib declarations.
This library comes with Solr, so you don’t have to alter your deployment in any way to make it work.
A. Create a new field type in the managed-schema file with the SmartChineseAnalyzer.
<fieldType name="text_mandarin" class="solr.TextField"> <analyzer> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.CJKWidthFilterFactory"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> </fieldType>
B. Create a field that uses this field type.
<field name=”text_man” type=”text_mandarin” multiValued=”true” indexed=”true” stored=”true”/>
Copyrights © SearchStax Inc.2014-2024. All Rights Reserved.
close
close
close