simple-query-string-query.asciidoc 6.7 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198
  1. [[query-dsl-simple-query-string-query]]
  2. === Simple Query String Query
  3. A query that uses the SimpleQueryParser to parse its context. Unlike the
  4. regular `query_string` query, the `simple_query_string` query will never
  5. throw an exception, and discards invalid parts of the query. Here is
  6. an example:
  7. [source,js]
  8. --------------------------------------------------
  9. GET /_search
  10. {
  11. "query": {
  12. "simple_query_string" : {
  13. "query": "\"fried eggs\" +(eggplant | potato) -frittata",
  14. "analyzer": "snowball",
  15. "fields": ["body^5","_all"],
  16. "default_operator": "and"
  17. }
  18. }
  19. }
  20. --------------------------------------------------
  21. // CONSOLE
  22. The `simple_query_string` top level parameters include:
  23. [cols="<,<",options="header",]
  24. |=======================================================================
  25. |Parameter |Description
  26. |`query` |The actual query to be parsed. See below for syntax.
  27. |`fields` |The fields to perform the parsed query against. Defaults to the
  28. `index.query.default_field` index settings, which in turn defaults to `_all`.
  29. |`default_operator` |The default operator used if no explicit operator
  30. is specified. For example, with a default operator of `OR`, the query
  31. `capital of Hungary` is translated to `capital OR of OR Hungary`, and
  32. with default operator of `AND`, the same query is translated to
  33. `capital AND of AND Hungary`. The default value is `OR`.
  34. |`analyzer` |The analyzer used to analyze each term of the query when
  35. creating composite queries.
  36. |`flags` |Flags specifying which features of the `simple_query_string` to
  37. enable. Defaults to `ALL`.
  38. |`analyze_wildcard` | Whether terms of prefix queries should be automatically
  39. analyzed or not. If `true` a best effort will be made to analyze the prefix. However,
  40. some analyzers will be not able to provide a meaningful results
  41. based just on the prefix of a term. Defaults to `false`.
  42. |`lenient` | If set to `true` will cause format based failures
  43. (like providing text to a numeric field) to be ignored.
  44. |`minimum_should_match` | The minimum number of clauses that must match for a
  45. document to be returned. See the
  46. <<query-dsl-minimum-should-match,`minimum_should_match`>> documentation for the
  47. full list of options.
  48. |`quote_field_suffix` | A suffix to append to fields for quoted parts of
  49. the query string. This allows to use a field that has a different analysis chain
  50. for exact matching. Look <<mixing-exact-search-with-stemming,here>> for a
  51. comprehensive example.
  52. |`auto_generate_synonyms_phrase_query` |Whether phrase queries should be automatically generated for multi terms synonyms.
  53. Defaults to `true`.
  54. |`all_fields` | Perform the query on all fields detected in the mapping that can
  55. be queried. Will be used by default when the `_all` field is disabled and no
  56. `default_field` is specified index settings, and no `fields` are specified.
  57. |=======================================================================
  58. [float]
  59. ===== Simple Query String Syntax
  60. The `simple_query_string` supports the following special characters:
  61. * `+` signifies AND operation
  62. * `|` signifies OR operation
  63. * `-` negates a single token
  64. * `"` wraps a number of tokens to signify a phrase for searching
  65. * `*` at the end of a term signifies a prefix query
  66. * `(` and `)` signify precedence
  67. * `~N` after a word signifies edit distance (fuzziness)
  68. * `~N` after a phrase signifies slop amount
  69. In order to search for any of these special characters, they will need to
  70. be escaped with `\`.
  71. Be aware that this syntax may have a different behavior depending on the
  72. `default_operator` value. For example, consider the following query:
  73. [source,js]
  74. --------------------------------------------------
  75. GET /_search
  76. {
  77. "query": {
  78. "simple_query_string" : {
  79. "fields" : ["content"],
  80. "query" : "foo bar -baz"
  81. }
  82. }
  83. }
  84. --------------------------------------------------
  85. // CONSOLE
  86. You may expect that documents containing only "foo" or "bar" will be returned,
  87. as long as they do not contain "baz", however, due to the `default_operator`
  88. being OR, this really means "match documents that contain "foo" or documents
  89. that contain "bar", or documents that don't contain "baz". If this is unintended
  90. then the query can be switched to `"foo bar +-baz"` which will not return
  91. documents that contain "baz".
  92. [float]
  93. ==== Default Field
  94. When not explicitly specifying the field to search on in the query
  95. string syntax, the `index.query.default_field` will be used to derive
  96. which field to search on. It defaults to `_all` field.
  97. If the `_all` field is disabled and no `fields` are specified in the request`,
  98. the `simple_query_string` query will automatically attempt to determine the
  99. existing fields in the index's mapping that are queryable, and perform the
  100. search on those fields.
  101. [float]
  102. ==== Multi Field
  103. The fields parameter can also include pattern based field names,
  104. allowing to automatically expand to the relevant fields (dynamically
  105. introduced fields included). For example:
  106. [source,js]
  107. --------------------------------------------------
  108. GET /_search
  109. {
  110. "query": {
  111. "simple_query_string" : {
  112. "fields" : ["content", "name.*^5"],
  113. "query" : "foo bar baz"
  114. }
  115. }
  116. }
  117. --------------------------------------------------
  118. // CONSOLE
  119. [float]
  120. ==== Flags
  121. `simple_query_string` support multiple flags to specify which parsing features
  122. should be enabled. It is specified as a `|`-delimited string with the
  123. `flags` parameter:
  124. [source,js]
  125. --------------------------------------------------
  126. GET /_search
  127. {
  128. "query": {
  129. "simple_query_string" : {
  130. "query" : "foo | bar + baz*",
  131. "flags" : "OR|AND|PREFIX"
  132. }
  133. }
  134. }
  135. --------------------------------------------------
  136. // CONSOLE
  137. The available flags are: `ALL`, `NONE`, `AND`, `OR`, `NOT`, `PREFIX`, `PHRASE`,
  138. `PRECEDENCE`, `ESCAPE`, `WHITESPACE`, `FUZZY`, `NEAR`, and `SLOP`.
  139. [float]
  140. ==== Synonyms
  141. The `simple_query_string` query supports multi-terms synonym expansion with the <<analysis-synonym-graph-tokenfilter,
  142. synonym_graph>> token filter. When this filter is used, the parser creates a phrase query for each multi-terms synonyms.
  143. For example, the following synonym: `"ny, new york" would produce:`
  144. `(ny OR ("new york"))`
  145. It is also possible to match multi terms synonyms with conjunctions instead:
  146. [source,js]
  147. --------------------------------------------------
  148. GET /_search
  149. {
  150. "query": {
  151. "simple_query_string" : {
  152. "query" : "ny city",
  153. "auto_generate_synonyms_phrase_query" : false
  154. }
  155. }
  156. }
  157. --------------------------------------------------
  158. // CONSOLE
  159. The example above creates a boolean query:
  160. `(ny OR (new AND york)) city)`
  161. that matches documents with the term `ny` or the conjunction `new AND york`.
  162. By default the parameter `auto_generate_synonyms_phrase_query` is set to `true`.