|
@@ -1,11 +1,11 @@
|
|
|
[[analysis-simple-analyzer]]
|
|
|
=== Simple Analyzer
|
|
|
|
|
|
-The `simple` analyzer breaks text into terms whenever it encounters a
|
|
|
-character which is not a letter. All terms are lower cased.
|
|
|
+The `simple` analyzer breaks text into terms at any non-letter character, such
|
|
|
+as numbers, spaces, hyphens and apostrophes, discards non-letter characters,
|
|
|
+and changes uppercase to lowercase.
|
|
|
|
|
|
-[float]
|
|
|
-=== Example output
|
|
|
+==== Example
|
|
|
|
|
|
[source,js]
|
|
|
---------------------------
|
|
@@ -108,30 +108,30 @@ POST _analyze
|
|
|
/////////////////////
|
|
|
|
|
|
|
|
|
-The above sentence would produce the following terms:
|
|
|
+The `simple` analyzer parses the sentence and produces the following
|
|
|
+terms:
|
|
|
|
|
|
[source,text]
|
|
|
---------------------------
|
|
|
[ the, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]
|
|
|
---------------------------
|
|
|
|
|
|
-[float]
|
|
|
-=== Configuration
|
|
|
+==== Configure parameters
|
|
|
|
|
|
-The `simple` analyzer is not configurable.
|
|
|
+The `simple` analyzer does not contain configurable parameters.
|
|
|
|
|
|
-[float]
|
|
|
-=== Definition
|
|
|
+==== Customize
|
|
|
|
|
|
-The `simple` analzyer consists of:
|
|
|
+The `simple` analyzer is defined by one tokenizer:
|
|
|
|
|
|
Tokenizer::
|
|
|
* <<analysis-lowercase-tokenizer,Lower Case Tokenizer>>
|
|
|
|
|
|
-If you need to customize the `simple` analyzer then you need to recreate
|
|
|
-it as a `custom` analyzer and modify it, usually by adding token filters.
|
|
|
-This would recreate the built-in `simple` analyzer and you can use it as
|
|
|
-a starting point for further customization:
|
|
|
+To customize the `simple` analyzer, duplicate it to create the basis for
|
|
|
+a `custom` analyzer. The new analyzer can be modified as required, usually by
|
|
|
+adding token filters.
|
|
|
+
|
|
|
+===== Example
|
|
|
|
|
|
[source,js]
|
|
|
----------------------------------------------------
|
|
@@ -152,4 +152,4 @@ PUT /simple_example
|
|
|
----------------------------------------------------
|
|
|
// CONSOLE
|
|
|
// TEST[s/\n$/\nstartyaml\n - compare_analyzers: {index: simple_example, first: simple, second: rebuilt_simple}\nendyaml\n/]
|
|
|
-<1> You'd add any token filters here.
|
|
|
+<1> Add token filters here.
|