migrate_3_0.asciidoc 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354
  1. [[breaking-changes-3.0]]
  2. == Breaking changes in 3.0
  3. This section discusses the changes that you need to be aware of when migrating
  4. your application to Elasticsearch 3.0.
  5. === Search changes
  6. ==== `search_type=count` removed
  7. The `count` search type was deprecated since version 2.0.0 and is now removed.
  8. In order to get the same benefits, you just need to set the value of the `size`
  9. parameter to `0`.
  10. For instance, the following request:
  11. [source,sh]
  12. ---------------
  13. GET /my_index/_search?search_type=count
  14. {
  15. "aggs": {
  16. "my_terms": {
  17. "terms": {
  18. "field": "foo"
  19. }
  20. }
  21. }
  22. }
  23. ---------------
  24. can be replaced with:
  25. [source,sh]
  26. ---------------
  27. GET /my_index/_search
  28. {
  29. "size": 0,
  30. "aggs": {
  31. "my_terms": {
  32. "terms": {
  33. "field": "foo"
  34. }
  35. }
  36. }
  37. }
  38. ---------------
  39. ==== `search_type=scan` removed
  40. The `scan` search type was deprecated since version 2.1.0 and is now removed.
  41. All benefits from this search type can now be achieved by doing a scroll
  42. request that sorts documents in `_doc` order, for instance:
  43. [source,sh]
  44. ---------------
  45. GET /my_index/_search?scroll=2m
  46. {
  47. "sort": [
  48. "_doc"
  49. ]
  50. }
  51. ---------------
  52. Scroll requests sorted by `_doc` have been optimized to more efficiently resume
  53. from where the previous request stopped, so this will have the same performance
  54. characteristics as the former `scan` search type.
  55. === REST API changes
  56. ==== search exists api removed
  57. The search exists api has been removed in favour of using the search api with
  58. `size` set to `0` and `terminate_after` set to `1`.
  59. ==== `/_optimize` endpoint removed
  60. The deprecated `/_optimize` endpoint has been removed. The `/_forcemerge`
  61. endpoint should be used in lieu of optimize.
  62. === Parent/Child changes
  63. The `children` aggregation, parent child inner hits and `has_child` and `has_parent` queries will not work on indices
  64. with `_parent` field mapping created before version `2.0.0`. The data of these indices need to be re-indexed into a new index.
  65. The format of the join between parent and child documents have changed with the `2.0.0` release. The old
  66. format can't read from version `3.0.0` and onwards. The new format allows for a much more efficient and
  67. scalable join between parent and child documents and the join data structures are stored on on disk
  68. data structures as opposed as before the join data structures were stored in the jvm heap space.
  69. ==== Deprecated queries removed
  70. The following deprecated queries have been removed:
  71. * `filtered`: use `bool` query instead, which supports `filter` clauses too
  72. * `and`: use `must` clauses in a `bool` query instead
  73. * `or`: use should clauses in a `bool` query instead
  74. * `limit`: use `terminate_after` parameter instead
  75. * `fquery`: obsolete after filters and queries have been merged
  76. * `query`: obsolete after filters and queries have been merged
  77. ==== `score_type` has been removed
  78. The `score_type` option has been removed from the `has_child` and `has_parent` queries in favour of the `score_mode` option
  79. which does the exact same thing.
  80. ==== `sum` score mode removed
  81. The `sum` score mode has been removed in favour of the `total` mode which doesn the same and is already available in
  82. previous versions.
  83. ==== `max_children` option
  84. When `max_children` was set to `0` on the `has_child` query then there was no upper limit on how many children documents
  85. are allowed to match. This has changed and `0` now really means to zero child documents are allowed. If no upper limit
  86. is needed then the `max_children` option shouldn't be defined at all on the `has_child` query.
  87. === Settings changes ===
  88. ==== Analysis settings
  89. The `index.analysis.analyzer.default_index` analyzer is not supported anymore.
  90. If you wish to change the analyzer to use for indexing, change the
  91. `index.analysis.analyzer.default` analyzer instead.
  92. ==== Ping timeout settings
  93. Previously, there were three settings for the ping timeout: `discovery.zen.initial_ping_timeout`,
  94. `discovery.zen.ping.timeout` and `discovery.zen.ping_timeout`. The former two have been removed and
  95. the only setting key for the ping timeout is now `discovery.zen.ping_timeout`. The default value for
  96. ping timeouts remains at three seconds.
  97. === Plugins
  98. Plugins implementing custom queries need to implement the `fromXContent(QueryParseContext)` method in their
  99. `QueryParser` subclass rather than `parse`. This method will take care of parsing the query from `XContent` format
  100. into an intermediate query representation that can be streamed between the nodes in binary format, effectively the
  101. query object used in the java api. Also, the query parser needs to implement the `getBuilderPrototype` method that
  102. returns a prototype of the `NamedWriteable` query, which allows to deserialize an incoming query by calling
  103. `readFrom(StreamInput)` against it, which will create a new object, see usages of `Writeable`. The `QueryParser`
  104. also needs to declare the generic type of the query that it supports and it's able to parse.
  105. The query object can then transform itself into a lucene query through the new `toQuery(QueryShardContext)` method,
  106. which returns a lucene query to be executed on the data node.
  107. Similarly, plugins implementing custom score functions need to implement the `fromXContent(QueryParseContext)`
  108. method in their `ScoreFunctionParser` subclass rather than `parse`. This method will take care of parsing
  109. the function from `XContent` format into an intermediate function representation that can be streamed between
  110. the nodes in binary format, effectively the function object used in the java api. Also, the query parser needs
  111. to implement the `getBuilderPrototype` method that returns a prototype of the `NamedWriteable` function, which
  112. allows to deserialize an incoming function by calling `readFrom(StreamInput)` against it, which will create a
  113. new object, see usages of `Writeable`. The `ScoreFunctionParser` also needs to declare the generic type of the
  114. function that it supports and it's able to parse. The function object can then transform itself into a lucene
  115. function through the new `toFunction(QueryShardContext)` method, which returns a lucene function to be executed
  116. on the data node.
  117. ==== Cloud AWS plugin
  118. Cloud AWS plugin has been split in two plugins:
  119. * {plugins}/discovery-ec2.html[Discovery EC2 plugin]
  120. * {plugins}/repository-s3.html[Repository S3 plugin]
  121. ==== Cloud Azure plugin
  122. Cloud Azure plugin has been split in three plugins:
  123. * {plugins}/discovery-azure.html[Discovery Azure plugin]
  124. * {plugins}/repository-azure.html[Repository Azure plugin]
  125. * {plugins}/store-smb.html[Store SMB plugin]
  126. ==== Cloud GCE plugin
  127. Cloud GCE plugin has been renamed to {plugins}/discovery-gce.html[Discovery GCE plugin].
  128. === Java-API
  129. ==== Count api has been removed
  130. The deprecated count api has been removed from the Java api, use the search api instead and set size to 0.
  131. The following call
  132. ```
  133. client.prepareCount(indices).setQuery(query).get();
  134. ```
  135. can be replaced with
  136. ```
  137. client.prepareSearch(indices).setSource(new SearchSourceBuilder().size(0).query(query)).get();
  138. ```
  139. ==== BoostingQueryBuilder
  140. Removed setters for mandatory positive/negative query. Both arguments now have
  141. to be supplied at construction time already and have to be non-null.
  142. ==== SpanContainingQueryBuilder
  143. Removed setters for mandatory big/little inner span queries. Both arguments now have
  144. to be supplied at construction time already and have to be non-null. Updated
  145. static factory methods in QueryBuilders accordingly.
  146. ==== SpanOrQueryBuilder
  147. Making sure that query contains at least one clause by making initial clause mandatory
  148. in constructor.
  149. ==== SpanNearQueryBuilder
  150. Removed setter for mandatory slop parameter, needs to be set in constructor now. Also
  151. making sure that query contains at least one clause by making initial clause mandatory
  152. in constructor. Updated the static factory methods in QueryBuilders accordingly.
  153. ==== SpanNotQueryBuilder
  154. Removed setter for mandatory include/exclude span query clause, needs to be set in constructor now.
  155. Updated the static factory methods in QueryBuilders and tests accordingly.
  156. ==== SpanWithinQueryBuilder
  157. Removed setters for mandatory big/little inner span queries. Both arguments now have
  158. to be supplied at construction time already and have to be non-null. Updated
  159. static factory methods in QueryBuilders accordingly.
  160. ==== QueryFilterBuilder
  161. Removed the setter `queryName(String queryName)` since this field is not supported
  162. in this type of query. Use `FQueryFilterBuilder.queryName(String queryName)` instead
  163. when in need to wrap a named query as a filter.
  164. ==== WrapperQueryBuilder
  165. Removed `wrapperQueryBuilder(byte[] source, int offset, int length)`. Instead simply
  166. use `wrapperQueryBuilder(byte[] source)`. Updated the static factory methods in
  167. QueryBuilders accordingly.
  168. ==== QueryStringQueryBuilder
  169. Removed ability to pass in boost value using `field(String field)` method in form e.g. `field^2`.
  170. Use the `field(String, float)` method instead.
  171. ==== Operator
  172. Removed the enums called `Operator` from `MatchQueryBuilder`, `QueryStringQueryBuilder`,
  173. `SimpleQueryStringBuilder`, and `CommonTermsQueryBuilder` in favour of using the enum
  174. defined in `org.elasticsearch.index.query.Operator` in an effort to consolidate the
  175. codebase and avoid duplication.
  176. ==== queryName and boost support
  177. Support for `queryName` and `boost` has been streamlined to all of the queries. That is
  178. a breaking change till queries get sent over the network as serialized json rather
  179. than in `Streamable` format. In fact whenever additional fields are added to the json
  180. representation of the query, older nodes might throw error when they find unknown fields.
  181. ==== InnerHitsBuilder
  182. InnerHitsBuilder now has a dedicated addParentChildInnerHits and addNestedInnerHits methods
  183. to differentiate between inner hits for nested vs. parent / child documents. This change
  184. makes the type / path parameter mandatory.
  185. ==== MatchQueryBuilder
  186. Moving MatchQueryBuilder.Type and MatchQueryBuilder.ZeroTermsQuery enum to MatchQuery.Type.
  187. Also reusing new Operator enum.
  188. ==== MoreLikeThisQueryBuilder
  189. Removed `MoreLikeThisQueryBuilder.Item#id(String id)`, `Item#doc(BytesReference doc)`,
  190. `Item#doc(XContentBuilder doc)`. Use provided constructors instead.
  191. Removed `MoreLikeThisQueryBuilder#addLike` in favor of texts and/or items beeing provided
  192. at construction time. Using arrays there instead of lists now.
  193. Removed `MoreLikeThisQueryBuilder#addUnlike` in favor to using the `unlike` methods
  194. which take arrays as arguments now rather than the lists used before.
  195. The deprecated `docs(Item... docs)`, `ignoreLike(Item... docs)`,
  196. `ignoreLike(String... likeText)`, `addItem(Item... likeItems)` have been removed.
  197. ==== GeoDistanceQueryBuilder
  198. Removing individual setters for lon() and lat() values, both values should be set together
  199. using point(lon, lat).
  200. ==== GeoDistanceRangeQueryBuilder
  201. Removing setters for to(Object ...) and from(Object ...) in favour of the only two allowed input
  202. arguments (String, Number). Removing setter for center point (point(), geohash()) because parameter
  203. is mandatory and should already be set in constructor.
  204. Also removing setters for lt(), lte(), gt(), gte() since they can all be replaced by equivallent
  205. calls to to/from() and inludeLower()/includeUpper().
  206. ==== GeoPolygonQueryBuilder
  207. Require shell of polygon already to be specified in constructor instead of adding it pointwise.
  208. This enables validation, but makes it necessary to remove the addPoint() methods.
  209. ==== MultiMatchQueryBuilder
  210. Moving MultiMatchQueryBuilder.ZeroTermsQuery enum to MatchQuery.ZeroTermsQuery.
  211. Also reusing new Operator enum.
  212. Removed ability to pass in boost value using `field(String field)` method in form e.g. `field^2`.
  213. Use the `field(String, float)` method instead.
  214. ==== MissingQueryBuilder
  215. The two individual setters for existence() and nullValue() were removed in favour of
  216. optional constructor settings in order to better capture and validate their interdependent
  217. settings at construction time.
  218. ==== NotQueryBuilder
  219. The NotQueryBuilder which was deprecated in 2.1.0 is removed. As a replacement use BoolQueryBuilder
  220. with added mustNot() clause. So instead of using `new NotQueryBuilder(filter)` now use
  221. `new BoolQueryBuilder().mustNot(filter)`.
  222. ==== TermsQueryBuilder
  223. Remove the setter for `termsLookup()`, making it only possible to either use a TermsLookup object or
  224. individual values at construction time. Also moving individual settings for the TermsLookup (lookupIndex,
  225. lookupType, lookupId, lookupPath) to the separate TermsLookup class, using constructor only and moving
  226. checks for validation there. Removed `TermsLookupQueryBuilder` in favour of `TermsQueryBuilder`.
  227. ==== FunctionScoreQueryBuilder
  228. `add` methods have been removed, all filters and functions must be provided as constructor arguments by
  229. creating an array of `FunctionScoreQueryBuilder.FilterFunctionBuilder` objects, containing one element
  230. for each filter/function pair.
  231. `scoreMode` and `boostMode` can only be provided using corresponding enum members instead
  232. of string values: see `FilterFunctionScoreQuery.ScoreMode` and `CombineFunction`.
  233. `CombineFunction.MULT` has been renamed to `MULTIPLY`.
  234. ==== IdsQueryBuilder
  235. For simplicity, only one way of adding the ids to the existing list (empty by default) is left: `addIds(String...)`
  236. ==== DocumentAlreadyExistsException removed
  237. `DocumentAlreadyExistsException` is removed and a `VersionConflictException` is thrown instead (with a better
  238. error description). This will influence code that use the `IndexRequest.opType()` or `IndexRequest.create()`
  239. to index a document only if it doesn't already exist.
  240. === Cache concurrency level settings removed
  241. Two cache concurrency level settings `indices.requests.cache.concurrency_level` and
  242. `indices.fielddata.cache.concurrency_level` because they no longer apply to the cache implementation used for the
  243. request cache and the field data cache.
  244. === Remove bind option of `non_loopback`
  245. This setting would arbitrarily pick the first interface not marked as loopback. Instead, specify by address
  246. scope (e.g. `_local_,_site_` for all loopback and private network addresses) or by explicit interface names,
  247. hostnames, or addresses.