lang-analyzer.asciidoc 1.0 KB

123456789101112131415161718192021
  1. [[analysis-lang-analyzer]]
  2. === Language Analyzers
  3. A set of analyzers aimed at analyzing specific language text. The
  4. following types are supported: `arabic`, `armenian`, `basque`,
  5. `brazilian`, `bulgarian`, `catalan`, `chinese`, `cjk`, `czech`,
  6. `danish`, `dutch`, `english`, `finnish`, `french`, `galician`, `german`,
  7. `greek`, `hindi`, `hungarian`, `indonesian`, `italian`, `norwegian`,
  8. `persian`, `portuguese`, `romanian`, `russian`, `spanish`, `swedish`,
  9. `turkish`, `thai`.
  10. All analyzers support setting custom `stopwords` either internally in
  11. the config, or by using an external stopwords file by setting
  12. `stopwords_path`. Check <<analysis-stop-analyzer,Stop Analyzer>> for
  13. more details.
  14. The following analyzers support setting custom `stem_exclusion` list:
  15. `arabic`, `armenian`, `basque`, `brazilian`, `bulgarian`, `catalan`,
  16. `czech`, `danish`, `dutch`, `english`, `finnish`, `french`, `galician`,
  17. `german`, `hindi`, `hungarian`, `indonesian`, `italian`, `norwegian`,
  18. `portuguese`, `romanian`, `russian`, `spanish`, `swedish`, `turkish`.