analysis-smartcn.asciidoc 1.4 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647
  1. [[analysis-smartcn]]
  2. === Smart Chinese Analysis Plugin
  3. The Smart Chinese Analysis plugin integrates Lucene's Smart Chinese analysis
  4. module into elasticsearch.
  5. It provides an analyzer for Chinese or mixed Chinese-English text. This
  6. analyzer uses probabilistic knowledge to find the optimal word segmentation
  7. for Simplified Chinese text. The text is first broken into sentences, then
  8. each sentence is segmented into words.
  9. [[analysis-smartcn-install]]
  10. [float]
  11. ==== Installation
  12. This plugin can be installed using the plugin manager:
  13. [source,sh]
  14. ----------------------------------------------------------------
  15. sudo bin/elasticsearch-plugin install analysis-smartcn
  16. ----------------------------------------------------------------
  17. The plugin must be installed on every node in the cluster, and each node must
  18. be restarted after installation.
  19. [[analysis-smartcn-remove]]
  20. [float]
  21. ==== Removal
  22. The plugin can be removed with the following command:
  23. [source,sh]
  24. ----------------------------------------------------------------
  25. sudo bin/elasticsearch-plugin remove analysis-smartcn
  26. ----------------------------------------------------------------
  27. The node must be stopped before removing the plugin.
  28. [[analysis-smartcn-tokenizer]]
  29. [float]
  30. ==== `smartcn` tokenizer and token filter
  31. The plugin provides the `smartcn` analyzer and `smartcn_tokenizer` tokenizer,
  32. which are not configurable.
  33. NOTE: The `smartcn_word` token filter and `smartcn_sentence` have been deprecated.