custom-analyzer.asciidoc 1.7 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152
  1. [[analysis-custom-analyzer]]
  2. === Custom Analyzer
  3. An analyzer of type `custom` that allows to combine a `Tokenizer` with
  4. zero or more `Token Filters`, and zero or more `Char Filters`. The
  5. custom analyzer accepts a logical/registered name of the tokenizer to
  6. use, and a list of logical/registered names of token filters.
  7. The following are settings that can be set for a `custom` analyzer type:
  8. [cols="<,<",options="header",]
  9. |=======================================================================
  10. |Setting |Description
  11. |`tokenizer` |The logical / registered name of the tokenizer to use.
  12. |`filter` |An optional list of logical / registered name of token
  13. filters.
  14. |`char_filter` |An optional list of logical / registered name of char
  15. filters.
  16. |=======================================================================
  17. Here is an example:
  18. [source,js]
  19. --------------------------------------------------
  20. index :
  21. analysis :
  22. analyzer :
  23. myAnalyzer2 :
  24. type : custom
  25. tokenizer : myTokenizer1
  26. filter : [myTokenFilter1, myTokenFilter2]
  27. char_filter : [my_html]
  28. tokenizer :
  29. myTokenizer1 :
  30. type : standard
  31. max_token_length : 900
  32. filter :
  33. myTokenFilter1 :
  34. type : stop
  35. stopwords : [stop1, stop2, stop3, stop4]
  36. myTokenFilter2 :
  37. type : length
  38. min : 0
  39. max : 2000
  40. char_filter :
  41. my_html :
  42. type : html_strip
  43. escaped_tags : [xxx, yyy]
  44. read_ahead : 1024
  45. --------------------------------------------------