123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869 |
- [[analysis]]
- == Analysis Plugins
- Analysis plugins extend Elasticsearch by adding new analyzers, tokenizers,
- token filters, or character filters to Elasticsearch.
- [discrete]
- ==== Core analysis plugins
- The core analysis plugins are:
- <<analysis-icu,ICU>>::
- Adds extended Unicode support using the http://site.icu-project.org/[ICU]
- libraries, including better analysis of Asian languages, Unicode
- normalization, Unicode-aware case folding, collation support, and
- transliteration.
- <<analysis-kuromoji,Kuromoji>>::
- Advanced analysis of Japanese using the https://www.atilika.org/[Kuromoji analyzer].
- <<analysis-nori,Nori>>::
- Morphological analysis of Korean using the Lucene Nori analyzer.
- <<analysis-phonetic,Phonetic>>::
- Analyzes tokens into their phonetic equivalent using Soundex, Metaphone,
- Caverphone, and other codecs.
- <<analysis-smartcn,SmartCN>>::
- An analyzer for Chinese or mixed Chinese-English text. This analyzer uses
- probabilistic knowledge to find the optimal word segmentation for Simplified
- Chinese text. The text is first broken into sentences, then each sentence is
- segmented into words.
- <<analysis-stempel,Stempel>>::
- Provides high quality stemming for Polish.
- <<analysis-ukrainian,Ukrainian>>::
- Provides stemming for Ukrainian.
- [discrete]
- ==== Community contributed analysis plugins
- A number of analysis plugins have been contributed by our community:
- * https://github.com/medcl/elasticsearch-analysis-ik[IK Analysis Plugin] (by Medcl)
- * https://github.com/medcl/elasticsearch-analysis-pinyin[Pinyin Analysis Plugin] (by Medcl)
- * https://github.com/duydo/elasticsearch-analysis-vietnamese[Vietnamese Analysis Plugin] (by Duy Do)
- * https://github.com/medcl/elasticsearch-analysis-stconvert[STConvert Analysis Plugin] (by Medcl)
- include::analysis-icu.asciidoc[]
- include::analysis-kuromoji.asciidoc[]
- include::analysis-nori.asciidoc[]
- include::analysis-phonetic.asciidoc[]
- include::analysis-smartcn.asciidoc[]
- include::analysis-stempel.asciidoc[]
- include::analysis-ukrainian.asciidoc[]
|