1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980818283848586878889909192939495 |
- [[configuring-analyzers]]
- === Configuring built-in analyzers
- The built-in analyzers can be used directly without any configuration. Some
- of them, however, support configuration options to alter their behaviour. For
- instance, the <<analysis-standard-analyzer,`standard` analyzer>> can be configured
- to support a list of stop words:
- [source,js]
- --------------------------------
- PUT my_index
- {
- "settings": {
- "analysis": {
- "analyzer": {
- "std_english": { <1>
- "type": "standard",
- "stopwords": "_english_"
- }
- }
- }
- },
- "mappings": {
- "properties": {
- "my_text": {
- "type": "text",
- "analyzer": "standard", <2>
- "fields": {
- "english": {
- "type": "text",
- "analyzer": "std_english" <3>
- }
- }
- }
- }
- }
- }
- POST my_index/_analyze
- {
- "field": "my_text", <2>
- "text": "The old brown cow"
- }
- POST my_index/_analyze
- {
- "field": "my_text.english", <3>
- "text": "The old brown cow"
- }
- --------------------------------
- // CONSOLE
- <1> We define the `std_english` analyzer to be based on the `standard`
- analyzer, but configured to remove the pre-defined list of English stopwords.
- <2> The `my_text` field uses the `standard` analyzer directly, without
- any configuration. No stop words will be removed from this field.
- The resulting terms are: `[ the, old, brown, cow ]`
- <3> The `my_text.english` field uses the `std_english` analyzer, so
- English stop words will be removed. The resulting terms are:
- `[ old, brown, cow ]`
- /////////////////////
- [source,console-result]
- ----------------------------
- {
- "tokens": [
- {
- "token": "old",
- "start_offset": 4,
- "end_offset": 7,
- "type": "<ALPHANUM>",
- "position": 1
- },
- {
- "token": "brown",
- "start_offset": 8,
- "end_offset": 13,
- "type": "<ALPHANUM>",
- "position": 2
- },
- {
- "token": "cow",
- "start_offset": 14,
- "end_offset": 17,
- "type": "<ALPHANUM>",
- "position": 3
- }
- ]
- }
- ----------------------------
- /////////////////////
|