| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143 | [[index-modules-store]]== StoreThe store module allows you to control how index data is stored and accessed on disk.[float][[file-system]]=== File system storage typesThere are different file system implementations or _storage types_. By default,Elasticsearch will pick the best implementation based on the operatingenvironment.This can be overridden for all indices by adding this to the`config/elasticsearch.yml` file:[source,yaml]---------------------------------index.store.type: niofs---------------------------------It is a _static_ setting that can be set on a per-index basis at indexcreation time:[source,js]---------------------------------PUT /my_index{  "settings": {    "index.store.type": "niofs"  }}---------------------------------// CONSOLEWARNING: This is an expert-only setting and may be removed in the future.The following sections lists all the different storage types supported.`fs`::Default file system implementation. This will pick the best implementationdepending on the operating environment, which is currently `hybridfs` on allsupported systems but is subject to change.[[simplefs]]`simplefs`::The Simple FS type is a straightforward implementation of file systemstorage (maps to Lucene `SimpleFsDirectory`) using a random access file.This implementation has poor concurrent performance (multiple threadswill bottleneck). It is usually better to use the `niofs` when you needindex persistence.[[niofs]]`niofs`::The NIO FS type stores the shard index on the file system (maps toLucene `NIOFSDirectory`) using NIO. It allows multiple threads to readfrom the same file concurrently. It is not recommended on Windowsbecause of a bug in the SUN Java implementation.[[mmapfs]]`mmapfs`::The MMap FS type stores the shard index on the file system (maps toLucene `MMapDirectory`) by mapping a file into memory (mmap). Memorymapping uses up a portion of the virtual memory address space in yourprocess equal to the size of the file being mapped. Before using thisclass, be sure you have allowed plenty of<<vm-max-map-count,virtual address space>>.[[hybridfs]]`hybridfs`::The `hybridfs` type is a hybrid of `niofs` and `mmapfs`, which chooses the bestfile system type for each type of file based on the read access pattern.Currently only the Lucene term dictionary, norms and doc values files arememory mapped. All other files are opened using Lucene `NIOFSDirectory`.Similarly to `mmapfs` be sure you have allowed plenty of<<vm-max-map-count,virtual address space>>.[[allow-mmap]]You can restrict the use of the `mmapfs` and the related `hybridfs` store typevia the setting `node.store.allow_mmap`. This is a boolean setting indicatingwhether or not memory-mapping is allowed. The default is to allow it. Thissetting is useful, for example, if you are in an environment where you can notcontrol the ability to create a lot of memory maps so you need disable theability to use memory-mapping.=== Pre-loading data into the file system cacheNOTE: This is an expert setting, the details of which may change in the future.By default, Elasticsearch completely relies on the operating system file systemcache for caching I/O operations. It is possible to set `index.store.preload`in order to tell the operating system to load the content of hot indexfiles into memory upon opening. This setting accept a comma-separated list offiles extensions: all files whose extension is in the list will be pre-loadedupon opening. This can be useful to improve search performance of an index,especially when the host operating system is restarted, since this causes thefile system cache to be trashed. However note that this may slow down theopening of indices, as they will only become available after data have beenloaded into physical memory.This setting is best-effort only and may not work at all depending on the storetype and host operating system.The `index.store.preload` is a static setting that can either be set in the`config/elasticsearch.yml`:[source,yaml]---------------------------------index.store.preload: ["nvd", "dvd"]---------------------------------or in the index settings at index creation time:[source,js]---------------------------------PUT /my_index{  "settings": {    "index.store.preload": ["nvd", "dvd"]  }}---------------------------------// CONSOLEThe default value is the empty array, which means that nothing will be loadedinto the file-system cache eagerly. For indices that are actively searched,you might want to set it to `["nvd", "dvd"]`, which will cause norms and docvalues to be loaded eagerly into physical memory. These are the two firstextensions to look at since Elasticsearch performs random access on them.A wildcard can be used in order to indicate that all files should be preloaded:`index.store.preload: ["*"]`. Note however that it is generally not useful toload all files into memory, in particular those for stored fields and termvectors, so a better option might be to set it to`["nvd", "dvd", "tim", "doc", "dim"]`, which will preload norms, doc values,terms dictionaries, postings lists and points, which are the most importantparts of the index for search and aggregations.Note that this setting can be dangerous on indices that are larger than the sizeof the main memory of the host, as it would cause the filesystem cache to betrashed upon reopens after large merges, which would make indexing and searching_slower_.
 |