docker.asciidoc 13 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297
  1. [[docker]]
  2. === Install Elasticsearch with Docker
  3. Elasticsearch is also available as a Docker image.
  4. The image is built with {xpack-ref}/index.html[X-Pack] and uses https://hub.docker.com/_/centos/[centos:7] as the base image.
  5. The source code can be found on https://github.com/elastic/elasticsearch-docker/tree/{branch}[GitHub].
  6. ==== Security note
  7. NOTE: {xpack-ref}/index.html[X-Pack] is preinstalled in this image.
  8. Please take a few minutes to familiarize yourself with {xpack-ref}/security-getting-started.html[X-Pack Security] and how to change default passwords. The default password for the `elastic` user is `changeme`.
  9. NOTE: X-Pack includes a trial license for 30 days. After that, you can obtain one of the https://www.elastic.co/subscriptions[available subscriptions] or {ref}/security-settings.html[disable Security]. The Basic license is free and includes the https://www.elastic.co/products/x-pack/monitoring[Monitoring] extension.
  10. Obtaining Elasticsearch for Docker is as simple as issuing a +docker pull+ command against the Elastic Docker registry.
  11. ifeval::["{release-state}"=="unreleased"]
  12. WARNING: Version {version} of Elasticsearch has not yet been released, so no Docker image is currently available for this version.
  13. endif::[]
  14. ifeval::["{release-state}"!="unreleased"]
  15. The Docker image can be retrieved with the following command:
  16. ["source","sh",subs="attributes"]
  17. --------------------------------------------
  18. docker pull {docker-image}
  19. --------------------------------------------
  20. endif::[]
  21. [[docker-cli-run]]
  22. ==== Running Elasticsearch from the command line
  23. [[docker-cli-run-dev-mode]]
  24. ===== Development mode
  25. ifeval::["{release-state}"=="unreleased"]
  26. WARNING: Version {version} of the Elasticsearch Docker image has not yet been released.
  27. endif::[]
  28. ifeval::["{release-state}"!="unreleased"]
  29. Elasticsearch can be quickly started for development or testing use with the following command:
  30. ["source","sh",subs="attributes"]
  31. --------------------------------------------
  32. docker run -p 9200:9200 -e "http.host=0.0.0.0" -e "transport.host=127.0.0.1" {docker-image}
  33. --------------------------------------------
  34. endif::[]
  35. [[docker-cli-run-prod-mode]]
  36. ===== Production mode
  37. [[docker-prod-prerequisites]]
  38. [IMPORTANT]
  39. =========================
  40. The `vm_max_map_count` kernel setting needs to be set to at least `262144` for production use.
  41. Depending on your platform:
  42. * Linux
  43. +
  44. The `vm_map_max_count` setting should be set permanently in /etc/sysctl.conf:
  45. +
  46. [source,sh]
  47. --------------------------------------------
  48. $ grep vm.max_map_count /etc/sysctl.conf
  49. vm.max_map_count=262144
  50. ----------------------------------
  51. +
  52. To apply the setting on a live system type: `sysctl -w vm.max_map_count=262144`
  53. +
  54. * OSX with https://docs.docker.com/engine/installation/mac/#/docker-for-mac[Docker for Mac]
  55. +
  56. The `vm_max_map_count` setting must be set within the xhyve virtual machine:
  57. +
  58. ["source","sh"]
  59. --------------------------------------------
  60. $ screen ~/Library/Containers/com.docker.docker/Data/com.docker.driver.amd64-linux/tty
  61. --------------------------------------------
  62. +
  63. Log in with 'root' and no password.
  64. Then configure the `sysctl` setting as you would for Linux:
  65. +
  66. ["source","sh"]
  67. --------------------------------------------
  68. sysctl -w vm.max_map_count=262144
  69. --------------------------------------------
  70. +
  71. * OSX with https://docs.docker.com/engine/installation/mac/#docker-toolbox[Docker Toolbox]
  72. +
  73. The `vm_max_map_count` setting must be set via docker-machine:
  74. +
  75. ["source","sh"]
  76. --------------------------------------------
  77. docker-machine ssh
  78. sudo sysctl -w vm.max_map_count=262144
  79. --------------------------------------------
  80. =========================
  81. The following example brings up a cluster comprising two Elasticsearch nodes.
  82. To bring up the cluster, use the <<docker-prod-cluster-composefile,`docker-compose.yml`>> and just type:
  83. ifeval::["{release-state}"=="unreleased"]
  84. WARNING: Version {version} of the Elasticsearch Docker image has not yet been released, so a `docker-compose.yml` is not available for this version.
  85. endif::[]
  86. ifeval::["{release-state}"!="unreleased"]
  87. ["source","sh"]
  88. --------------------------------------------
  89. docker-compose up
  90. --------------------------------------------
  91. endif::[]
  92. [NOTE]
  93. `docker-compose` is not pre-installed with Docker on Linux.
  94. Instructions for installing it can be found on the https://docs.docker.com/compose/install/#install-using-pip[docker-compose webpage].
  95. The node `elasticsearch1` listens on `localhost:9200` while `elasticsearch2` talks to `elasticsearch1` over a Docker network.
  96. This example also uses https://docs.docker.com/engine/tutorials/dockervolumes[Docker named volumes], called `esdata1` and `esdata2` which will be created if not already present.
  97. [[docker-prod-cluster-composefile]]
  98. `docker-compose.yml`:
  99. ifeval::["{release-state}"=="unreleased"]
  100. WARNING: Version {version} of the Elasticsearch Docker image has not yet been released, so a `docker-compose.yml` is not available for this version.
  101. endif::[]
  102. ifeval::["{release-state}"!="unreleased"]
  103. ["source","yaml",subs="attributes"]
  104. --------------------------------------------
  105. version: '2'
  106. services:
  107. elasticsearch1:
  108. image: docker.elastic.co/elasticsearch/elasticsearch:{version}
  109. container_name: elasticsearch1
  110. environment:
  111. - cluster.name=docker-cluster
  112. - bootstrap.memory_lock=true
  113. - "ES_JAVA_OPTS=-Xms512m -Xmx512m"
  114. ulimits:
  115. memlock:
  116. soft: -1
  117. hard: -1
  118. mem_limit: 1g
  119. volumes:
  120. - esdata1:/usr/share/elasticsearch/data
  121. ports:
  122. - 9200:9200
  123. networks:
  124. - esnet
  125. elasticsearch2:
  126. image: docker.elastic.co/elasticsearch/elasticsearch:{version}
  127. environment:
  128. - cluster.name=docker-cluster
  129. - bootstrap.memory_lock=true
  130. - "ES_JAVA_OPTS=-Xms512m -Xmx512m"
  131. - "discovery.zen.ping.unicast.hosts=elasticsearch1"
  132. ulimits:
  133. memlock:
  134. soft: -1
  135. hard: -1
  136. mem_limit: 1g
  137. volumes:
  138. - esdata2:/usr/share/elasticsearch/data
  139. networks:
  140. - esnet
  141. volumes:
  142. esdata1:
  143. driver: local
  144. esdata2:
  145. driver: local
  146. networks:
  147. esnet:
  148. --------------------------------------------
  149. endif::[]
  150. To stop the cluster, type `docker-compose down`. Data volumes will persist, so it's possible to start the cluster again with the same data using `docker-compose up`.
  151. To destroy the cluster **and the data volumes** just type `docker-compose down -v`.
  152. ===== Inspect status of cluster:
  153. ["source","sh"]
  154. --------------------------------------------
  155. curl -u elastic http://127.0.0.1:9200/_cat/health
  156. Enter host password for user 'elastic':
  157. 1472225929 15:38:49 docker-cluster green 2 2 4 2 0 0 0 0 - 100.0%
  158. --------------------------------------------
  159. // NOTCONSOLE
  160. // This is demonstrating curl. Console will prompt you for a username and
  161. // password so no need to demonstrate that. Converting this would not show the
  162. // important `-u elastic` parameters for `curl`.
  163. Log messages go to the console and are handled by the configured Docker logging driver. By default you can access logs with `docker logs`.
  164. [[docker-configuration-methods]]
  165. ==== Configuring Elasticsearch with Docker
  166. Elasticsearch loads its configuration from files under `/usr/share/elasticsearch/config/`. These configuration files are documented in <<settings>> and <<jvm-options>>.
  167. The image offers several methods for configuring Elasticsearch settings with the conventional approach being to provide customized files, i.e. `elasticsearch.yml`, but it's also possible to use environment variables to set options:
  168. ===== A. Present the parameters via Docker environment variables
  169. For example, to define the cluster name with `docker run` you can pass `-e "cluster.name=mynewclustername"`. Double quotes are required.
  170. ===== B. Bind-mounted configuration
  171. Create your custom config file and mount this over the image's corresponding file.
  172. For example, bind-mounting a `custom_elasticsearch.yml` with `docker run` can be accomplished with the parameter:
  173. ["source","sh"]
  174. --------------------------------------------
  175. -v full_path_to/custom_elasticsearch.yml:/usr/share/elasticsearch/config/elasticsearch.yml
  176. --------------------------------------------
  177. IMPORTANT: The container **runs Elasticsearch as user `elasticsearch` using uid:gid `1000:1000`**. Bind mounted host directories and files, such as `custom_elasticsearch.yml` above, **need to be accessible by this user**. For the https://www.elastic.co/guide/en/elasticsearch/reference/current/important-settings.html#path-settings[data and log dirs], such as `/usr/share/elasticsearch/data`, write access is required as well.
  178. ===== C. Customized image
  179. In some environments, it may make more sense to prepare a custom image containing your configuration. A `Dockerfile` to achieve this may be as simple as:
  180. ["source","sh",subs="attributes"]
  181. --------------------------------------------
  182. FROM docker.elastic.co/elasticsearch/elasticsearch:{version}
  183. ADD elasticsearch.yml /usr/share/elasticsearch/config/
  184. USER root
  185. RUN chown elasticsearch:elasticsearch config/elasticsearch.yml
  186. USER elasticsearch
  187. --------------------------------------------
  188. You could then build and try the image with something like:
  189. ["source","sh"]
  190. --------------------------------------------
  191. docker build --tag=elasticsearch-custom .
  192. docker run -ti -v /usr/share/elasticsearch/data elasticsearch-custom
  193. --------------------------------------------
  194. ===== D. Override the image's default https://docs.docker.com/engine/reference/run/#cmd-default-command-or-options[CMD]
  195. Options can be passed as command-line options to the Elasticsearch process by
  196. overriding the default command for the image. For example:
  197. ["source","sh"]
  198. --------------------------------------------
  199. docker run <various parameters> bin/elasticsearch -Ecluster.name=mynewclustername
  200. --------------------------------------------
  201. ==== Notes for production use and defaults
  202. We have collected a number of best practices for production use.
  203. NOTE: Any Docker parameters mentioned below assume the use of `docker run`.
  204. . Elasticsearch runs inside the container as user `elasticsearch` using uid:gid `1000:1000`. If you are bind-mounting a local directory or file, ensure it is readable by this user, while the <<path-settings,data and log dirs>> additionally require write access.
  205. +
  206. . It is important to ensure increased ulimits for <<setting-system-settings,nofile>> and <<max-number-threads-check,nproc>> are available for the Elasticsearch containers. Verify the https://github.com/moby/moby/tree/ea4d1243953e6b652082305a9c3cda8656edab26/contrib/init[init system] for the Docker daemon is already setting those to acceptable values and, if needed, adjust them in the Daemon, or override them per container, for example using `docker run`:
  207. +
  208. --ulimit nofile=65536:65536
  209. +
  210. NOTE: One way of checking the Docker daemon defaults for the aforementioned ulimits is by running:
  211. +
  212. docker run --rm centos:7 /bin/bash -c 'ulimit -Hn && ulimit -Sn && ulimit -Hu && ulimit -Su'
  213. +
  214. . Swapping needs to be disabled for performance and node stability. This can be achieved through any of the methods mentioned in the <<setup-configuration-memory,Elasticsearch docs>>. If you opt for the `bootstrap.memory_lock: true` approach, apart from defining it through any of the <<docker-configuration-methods,configuration methods>>, you will additionally need the `memlock: true` ulimit, either defined in the https://docs.docker.com/engine/reference/commandline/dockerd/#default-ulimits[Docker Daemon] or specifically set for the container. This has been demonstrated earlier in the <<docker-prod-cluster-composefile,docker-compose.yml>>, or using `docker run`:
  215. +
  216. -e "bootstrap_memory_lock=true" --ulimit memlock=-1:-1
  217. +
  218. . The image https://docs.docker.com/engine/reference/builder/#/expose[exposes] TCP ports 9200 and 9300. For clusters it is recommended to randomize the published ports with `--publish-all`, unless you are pinning one container per host.
  219. +
  220. . Use the `ES_JAVA_OPTS` environment variable to set heap size, e.g. to use 16GB use `-e ES_JAVA_OPTS="-Xms16g -Xmx16g"` with `docker run`. It is also recommended to set a https://docs.docker.com/engine/reference/run/#user-memory-constraints[memory limit] for the container.
  221. +
  222. . Pin your deployments to a specific version of the Elasticsearch Docker image, e.g. +docker.elastic.co/elasticsearch/elasticsearch:{version}+.
  223. +
  224. . Always use a volume bound on `/usr/share/elasticsearch/data`, as shown in the <<docker-cli-run-prod-mode,production example>>, for the following reasons:
  225. +
  226. .. The data of your elasticsearch node won't be lost if the container is killed
  227. .. Elasticsearch is I/O sensitive and the Docker storage driver is not ideal for fast I/O
  228. .. It allows the use of advanced https://docs.docker.com/engine/extend/plugins/#volume-plugins[Docker volume plugins]
  229. +
  230. . If you are using the devicemapper storage driver (default on at least RedHat (rpm) based distributions) make sure you are not using the default `loop-lvm` mode. Configure docker-engine to use https://docs.docker.com/engine/userguide/storagedriver/device-mapper-driver/#configure-docker-with-devicemapper[direct-lvm] instead.
  231. +
  232. . Consider centralizing your logs by using a different https://docs.docker.com/engine/admin/logging/overview/[logging driver]. Also note that the default json-file logging driver is not ideally suited for production use.
  233. include::next-steps.asciidoc[]