query-string-query.asciidoc 8.8 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276
  1. [[query-dsl-query-string-query]]
  2. === Query String Query
  3. A query that uses a query parser in order to parse its content. Here is
  4. an example:
  5. [source,js]
  6. --------------------------------------------------
  7. GET /_search
  8. {
  9. "query": {
  10. "query_string" : {
  11. "default_field" : "content",
  12. "query" : "this AND that OR thus"
  13. }
  14. }
  15. }
  16. --------------------------------------------------
  17. // CONSOLE
  18. The `query_string` query parses the input and splits text around logical operators.
  19. Each textual part is analyzed independently of each other. For instance the following query:
  20. [source,js]
  21. --------------------------------------------------
  22. GET /_search
  23. {
  24. "query": {
  25. "query_string" : {
  26. "default_field" : "content",
  27. "query" : "(new york) AND ny"
  28. }
  29. }
  30. }
  31. --------------------------------------------------
  32. // CONSOLE
  33. ... will be splitted in `new york` and `ny` and each part is analyzed independently.
  34. When multiple fields are provided it is also possible to modify how the different
  35. field queries are combined inside each textual part using the `type` parameter.
  36. The possible modes are described <<multi-match-types, here>> and the default is `best_fields`.
  37. The `query_string` top level parameters include:
  38. [cols="<,<",options="header",]
  39. |=======================================================================
  40. |Parameter |Description
  41. |`query` |The actual query to be parsed. See <<query-string-syntax>>.
  42. |`default_field` |The default field for query terms if no prefix field
  43. is specified. Defaults to the `index.query.default_field` index
  44. settings, which in turn defaults to `*`.
  45. `*` extracts all fields in the mapping that are eligible to term queries
  46. and filters the metadata fields. All extracted fields are then combined
  47. to build a query when no prefix field is provided.
  48. |`default_operator` |The default operator used if no explicit operator
  49. is specified. For example, with a default operator of `OR`, the query
  50. `capital of Hungary` is translated to `capital OR of OR Hungary`, and
  51. with default operator of `AND`, the same query is translated to
  52. `capital AND of AND Hungary`. The default value is `OR`.
  53. |`analyzer` |The analyzer name used to analyze the query string.
  54. |`allow_leading_wildcard` |When set, `*` or `?` are allowed as the first
  55. character. Defaults to `true`.
  56. |`enable_position_increments` |Set to `true` to enable position
  57. increments in result queries. Defaults to `true`.
  58. |`fuzzy_max_expansions` |Controls the number of terms fuzzy queries will
  59. expand to. Defaults to `50`
  60. |`fuzziness` |Set the fuzziness for fuzzy queries. Defaults
  61. to `AUTO`. See <<fuzziness>> for allowed settings.
  62. |`fuzzy_prefix_length` |Set the prefix length for fuzzy queries. Default
  63. is `0`.
  64. |`phrase_slop` |Sets the default slop for phrases. If zero, then exact
  65. phrase matches are required. Default value is `0`.
  66. |`boost` |Sets the boost value of the query. Defaults to `1.0`.
  67. |`auto_generate_phrase_queries` |Defaults to `false`.
  68. |`analyze_wildcard` |By default, wildcards terms in a query string are
  69. not analyzed. By setting this value to `true`, a best effort will be
  70. made to analyze those as well.
  71. |`max_determinized_states` |Limit on how many automaton states regexp
  72. queries are allowed to create. This protects against too-difficult
  73. (e.g. exponentially hard) regexps. Defaults to 10000.
  74. |`minimum_should_match` |A value controlling how many "should" clauses
  75. in the resulting boolean query should match. It can be an absolute value
  76. (`2`), a percentage (`30%`) or a
  77. <<query-dsl-minimum-should-match,combination of
  78. both>>.
  79. |`lenient` |If set to `true` will cause format based failures (like
  80. providing text to a numeric field) to be ignored.
  81. |`time_zone` | Time Zone to be applied to any range query related to dates. See also
  82. http://www.joda.org/joda-time/apidocs/org/joda/time/DateTimeZone.html[JODA timezone].
  83. |`quote_field_suffix` | A suffix to append to fields for quoted parts of
  84. the query string. This allows to use a field that has a different analysis chain
  85. for exact matching. Look <<mixing-exact-search-with-stemming,here>> for a
  86. comprehensive example.
  87. |`all_fields` | deprecated[6.0.0, set `default_field` to `*` instead]
  88. Perform the query on all fields detected in the mapping that can
  89. be queried. Will be used by default when the `_all` field is disabled and no
  90. `default_field` is specified (either in the index settings or in the request
  91. body) and no `fields` are specified.
  92. |=======================================================================
  93. When a multi term query is being generated, one can control how it gets
  94. rewritten using the
  95. <<query-dsl-multi-term-rewrite,rewrite>>
  96. parameter.
  97. [float]
  98. ==== Default Field
  99. When not explicitly specifying the field to search on in the query
  100. string syntax, the `index.query.default_field` will be used to derive
  101. which field to search on. If the `index.query.default_field` is not specified,
  102. the `query_string` will automatically attempt to determine the existing fields in the index's
  103. mapping that are queryable, and perform the search on those fields. Note that this will not
  104. include nested documents, use a nested query to search those documents.
  105. [float]
  106. ==== Multi Field
  107. The `query_string` query can also run against multiple fields. Fields can be
  108. provided via the `"fields"` parameter (example below).
  109. The idea of running the `query_string` query against multiple fields is to
  110. expand each query term to an OR clause like this:
  111. field1:query_term OR field2:query_term | ...
  112. For example, the following query
  113. [source,js]
  114. --------------------------------------------------
  115. GET /_search
  116. {
  117. "query": {
  118. "query_string" : {
  119. "fields" : ["content", "name"],
  120. "query" : "this AND that"
  121. }
  122. }
  123. }
  124. --------------------------------------------------
  125. // CONSOLE
  126. matches the same words as
  127. [source,js]
  128. --------------------------------------------------
  129. GET /_search
  130. {
  131. "query": {
  132. "query_string": {
  133. "query": "(content:this OR name:this) AND (content:that OR name:that)"
  134. }
  135. }
  136. }
  137. --------------------------------------------------
  138. // CONSOLE
  139. Since several queries are generated from the individual search terms,
  140. combining them is automatically done using a `dis_max` query with a tie_breaker.
  141. For example (the `name` is boosted by 5 using `^5` notation):
  142. [source,js]
  143. --------------------------------------------------
  144. GET /_search
  145. {
  146. "query": {
  147. "query_string" : {
  148. "fields" : ["content", "name^5"],
  149. "query" : "this AND that OR thus",
  150. "tie_breaker" : 0
  151. }
  152. }
  153. }
  154. --------------------------------------------------
  155. // CONSOLE
  156. Simple wildcard can also be used to search "within" specific inner
  157. elements of the document. For example, if we have a `city` object with
  158. several fields (or inner object with fields) in it, we can automatically
  159. search on all "city" fields:
  160. [source,js]
  161. --------------------------------------------------
  162. GET /_search
  163. {
  164. "query": {
  165. "query_string" : {
  166. "fields" : ["city.*"],
  167. "query" : "this AND that OR thus"
  168. }
  169. }
  170. }
  171. --------------------------------------------------
  172. // CONSOLE
  173. Another option is to provide the wildcard fields search in the query
  174. string itself (properly escaping the `*` sign), for example:
  175. `city.\*:something`:
  176. [source,js]
  177. --------------------------------------------------
  178. GET /_search
  179. {
  180. "query": {
  181. "query_string" : {
  182. "query" : "city.\\*:(this AND that OR thus)"
  183. }
  184. }
  185. }
  186. --------------------------------------------------
  187. // CONSOLE
  188. NOTE: Since `\` (backslash) is a special character in json strings, it needs to
  189. be escaped, hence the two backslashes in the above `query_string`.
  190. When running the `query_string` query against multiple fields, the
  191. following additional parameters are allowed:
  192. [cols="<,<",options="header",]
  193. |=======================================================================
  194. |Parameter |Description
  195. |`type` |How the fields should be combined to build the text query.
  196. See <<multi-match-types, types>> for a complete example.
  197. Defaults to `best_fields`
  198. |=======================================================================
  199. [cols="<,<",options="header",]
  200. |=======================================================================
  201. |Parameter |Description
  202. |`tie_breaker` |The disjunction max tie breaker for multi fields.
  203. Defaults to `0`
  204. |=======================================================================
  205. The fields parameter can also include pattern based field names,
  206. allowing to automatically expand to the relevant fields (dynamically
  207. introduced fields included). For example:
  208. [source,js]
  209. --------------------------------------------------
  210. GET /_search
  211. {
  212. "query": {
  213. "query_string" : {
  214. "fields" : ["content", "name.*^5"],
  215. "query" : "this AND that OR thus"
  216. }
  217. }
  218. }
  219. --------------------------------------------------
  220. // CONSOLE
  221. include::query-string-syntax.asciidoc[]