analysis.asciidoc 1.5 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
  1. [[analysis]]
  2. = Text analysis
  3. :lucene-analysis-docs: https://lucene.apache.org/core/{lucene_version_path}/analyzers-common/org/apache/lucene/analysis
  4. [partintro]
  5. --
  6. _Text analysis_ is the process of converting unstructured text, like
  7. the body of an email or a product description, into a structured format that's
  8. optimized for search.
  9. [float]
  10. [[when-to-configure-analysis]]
  11. === When to configure text analysis
  12. {es} performs text analysis when indexing or searching <<text,`text`>> fields.
  13. If your index doesn't contain `text` fields, no further setup is needed; you can
  14. skip the pages in this section.
  15. However, if you use `text` fields or your text searches aren't returning results
  16. as expected, configuring text analysis can often help. You should also look into
  17. analysis configuration if you're using {es} to:
  18. * Build a search engine
  19. * Mine unstructured data
  20. * Fine-tune search for a specific language
  21. * Perform lexicographic or linguistic research
  22. [float]
  23. [[analysis-toc]]
  24. === In this section
  25. * <<analysis-overview>>
  26. * <<analysis-concepts>>
  27. * <<configure-text-analysis>>
  28. * <<analysis-analyzers>>
  29. * <<analysis-tokenizers>>
  30. * <<analysis-tokenfilters>>
  31. * <<analysis-charfilters>>
  32. * <<analysis-normalizers>>
  33. --
  34. include::analysis/overview.asciidoc[]
  35. include::analysis/concepts.asciidoc[]
  36. include::analysis/configure-text-analysis.asciidoc[]
  37. include::analysis/analyzers.asciidoc[]
  38. include::analysis/tokenizers.asciidoc[]
  39. include::analysis/tokenfilters.asciidoc[]
  40. include::analysis/charfilters.asciidoc[]
  41. include::analysis/normalizers.asciidoc[]