| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114 | [[analysis-whitespace-tokenizer]]=== Whitespace AnalyzerThe `whitespace` tokenizer breaks text into terms whenever it encounters awhitespace character.[float]=== Example output[source,js]---------------------------POST _analyze{  "tokenizer": "whitespace",  "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."}---------------------------// CONSOLE/////////////////////[source,js]----------------------------{  "tokens": [    {      "token": "The",      "start_offset": 0,      "end_offset": 3,      "type": "word",      "position": 0    },    {      "token": "2",      "start_offset": 4,      "end_offset": 5,      "type": "word",      "position": 1    },    {      "token": "QUICK",      "start_offset": 6,      "end_offset": 11,      "type": "word",      "position": 2    },    {      "token": "Brown-Foxes",      "start_offset": 12,      "end_offset": 23,      "type": "word",      "position": 3    },    {      "token": "jumped",      "start_offset": 24,      "end_offset": 30,      "type": "word",      "position": 4    },    {      "token": "over",      "start_offset": 31,      "end_offset": 35,      "type": "word",      "position": 5    },    {      "token": "the",      "start_offset": 36,      "end_offset": 39,      "type": "word",      "position": 6    },    {      "token": "lazy",      "start_offset": 40,      "end_offset": 44,      "type": "word",      "position": 7    },    {      "token": "dog's",      "start_offset": 45,      "end_offset": 50,      "type": "word",      "position": 8    },    {      "token": "bone.",      "start_offset": 51,      "end_offset": 56,      "type": "word",      "position": 9    }  ]}----------------------------// TESTRESPONSE/////////////////////The above sentence would produce the following terms:[source,text]---------------------------[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]---------------------------[float]=== ConfigurationThe `whitespace` tokenizer is not configurable.
 |