analysis.asciidoc 1.7 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162
  1. [[analysis]]
  2. = Text analysis
  3. :lucene-analysis-docs: https://lucene.apache.org/core/{lucene_version_path}/analysis/common/org/apache/lucene/analysis
  4. :lucene-gh-main-link: https://github.com/apache/lucene/blob/main/lucene
  5. :lucene-stop-word-link: {lucene-gh-main-link}/analysis/common/src/resources/org/apache/lucene/analysis
  6. [partintro]
  7. --
  8. _Text analysis_ is the process of converting unstructured text, like
  9. the body of an email or a product description, into a structured format that's
  10. optimized for search.
  11. [discrete]
  12. [[when-to-configure-analysis]]
  13. === When to configure text analysis
  14. {es} performs text analysis when indexing or searching <<text,`text`>> fields.
  15. If your index doesn't contain `text` fields, no further setup is needed; you can
  16. skip the pages in this section.
  17. However, if you use `text` fields or your text searches aren't returning results
  18. as expected, configuring text analysis can often help. You should also look into
  19. analysis configuration if you're using {es} to:
  20. * Build a search engine
  21. * Mine unstructured data
  22. * Fine-tune search for a specific language
  23. * Perform lexicographic or linguistic research
  24. [discrete]
  25. [[analysis-toc]]
  26. === In this section
  27. * <<analysis-overview>>
  28. * <<analysis-concepts>>
  29. * <<configure-text-analysis>>
  30. * <<analysis-analyzers>>
  31. * <<analysis-tokenizers>>
  32. * <<analysis-tokenfilters>>
  33. * <<analysis-charfilters>>
  34. * <<analysis-normalizers>>
  35. --
  36. include::analysis/overview.asciidoc[]
  37. include::analysis/concepts.asciidoc[]
  38. include::analysis/configure-text-analysis.asciidoc[]
  39. include::analysis/analyzers.asciidoc[]
  40. include::analysis/tokenizers.asciidoc[]
  41. include::analysis/tokenfilters.asciidoc[]
  42. include::analysis/charfilters.asciidoc[]
  43. include::analysis/normalizers.asciidoc[]