| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475 | [role="xpack"][[ecommerce-transforms]]= Tutorial: Transforming the eCommerce sample data<<transforms,{transforms-cap}>> enable you to retrieve informationfrom an {es} index, transform it, and store it in another index. Let's use the{kibana-ref}/add-sample-data.html[{kib} sample data] to demonstrate how you canpivot and summarize your data with {transforms}.. Verify that your environment is set up properly to use {transforms}. If the{es} {security-features} are enabled, to complete this tutorial you need a userthat has authority to preview and create {transforms}. You must also havespecific index privileges for the source and destination indices. See<<transform-setup>>.. Choose your _source index_.+--In this example, we'll use the eCommerce orders sample data. If you're notalready familiar with the `kibana_sample_data_ecommerce` index, use the*Revenue* dashboard in {kib} to explore the data. Consider what insights youmight want to derive from this eCommerce data.--. Choose the pivot type of {transform} and play with various options forgrouping and aggregating the data.+--There are two types of {transforms}, but first we'll try out _pivoting_ yourdata, which involves using at least one field to group it and applying at leastone aggregation. You can preview what the transformed data will looklike, so go ahead and play with it! You can also enable histogram charts to geta better understanding of the distribution of values in your data.For example, you might want to group the data by product ID and calculate thetotal number of sales for each product and its average price. Alternatively, youmight want to look at the behavior of individual customers and calculate howmuch each customer spent in total and how many different categories of productsthey purchased. Or you might want to take the currencies or geographies intoconsideration. What are the most interesting ways you can transform andinterpret this data?Go to *Management* > *Stack Management* > *Data* > *Transforms* in {kib} and usethe wizard to create a {transform}:[role="screenshot"]image::images/ecommerce-pivot1.png["Creating a simple {transform} in {kib}"]Group the data by customer ID and add one or more aggregations to learn moreabout each customer's orders. For example, let's calculate the sum of productsthey purchased, the total price of their purchases, the maximum number ofproducts that they purchased in a single order, and their total number of orders. We'll accomplish this by using the<<search-aggregations-metrics-sum-aggregation,`sum` aggregation>> on the`total_quantity` and `taxless_total_price` fields, the<<search-aggregations-metrics-max-aggregation,`max` aggregation>> on the`total_quantity` field, and the<<search-aggregations-metrics-cardinality-aggregation,`cardinality` aggregation>>on the `order_id` field:[role="screenshot"]image::images/ecommerce-pivot2.png["Adding multiple aggregations to a {transform} in {kib}"]TIP: If you're interested in a subset of the data, you can optionally include a<<request-body-search-query,query>> element. In thisexample, we've filtered the data so that we're only looking at orders with a`currency` of `EUR`. Alternatively, we could group the data by that field too.If you want to use more complex queries, you can create your {dataframe} from a{kibana-ref}/save-open-search.html[saved search].If you prefer, you can use the<<preview-transform,preview {transforms} API>>..API example[%collapsible]====[source,console]--------------------------------------------------POST _transform/_preview{  "source": {    "index": "kibana_sample_data_ecommerce",    "query": {      "bool": {        "filter": {          "term": {"currency": "EUR"}        }      }    }  },  "pivot": {    "group_by": {      "customer_id": {        "terms": {          "field": "customer_id"        }      }    },    "aggregations": {      "total_quantity.sum": {        "sum": {          "field": "total_quantity"        }      },      "taxless_total_price.sum": {        "sum": {          "field": "taxless_total_price"        }      },      "total_quantity.max": {        "max": {          "field": "total_quantity"        }      },      "order_id.cardinality": {        "cardinality": {          "field": "order_id"        }      }    }  }}--------------------------------------------------// TEST[skip:set up sample data]====--. When you are satisfied with what you see in the preview, create the{transform}.+--.. Supply a {transform} ID, the name of the destination index and optionally adescription. If the destination index does not exist, it will be createdautomatically when you start the {transform}... Decide whether you want the {transform} to run once or continuously. Since this sample data index is unchanging, let's use the default behavior and just run the {transform} once. If you want to try it out, however, go ahead and click on *Continuous mode*. You must choose a field that the {transform} can use to check which entities have changed. In general, it's a good idea to use the ingest timestamp field. In this example, however, you can use the `order_date` field... Optionally, you can configure a retention policy that applies to your {transform}. Select a date field that is used to identify old documents in the destination index and provide a maximum age. Documents that are older than the configured value are removed from the destination index.[role="screenshot"]image::images/ecommerce-pivot3.png["Adding transfrom ID and retention policy to a {transform} in {kib}"]In {kib}, before you finish creating the {transform}, you can copy the preview {transform} API request to your clipboard. This information is useful later when you're deciding whether you want to manually create the destination index.[role="screenshot"]image::images/ecommerce-pivot4.png["Copy the Dev Console statement of the transform preview to the clipboard"]If you prefer, you can use the<<put-transform,create {transforms} API>>..API example[%collapsible]====[source,console]--------------------------------------------------PUT _transform/ecommerce-customer-transform{  "source": {    "index": [      "kibana_sample_data_ecommerce"    ],    "query": {      "bool": {        "filter": {          "term": {            "currency": "EUR"          }        }      }    }  },  "pivot": {    "group_by": {      "customer_id": {        "terms": {          "field": "customer_id"        }      }    },    "aggregations": {      "total_quantity.sum": {        "sum": {          "field": "total_quantity"        }      },      "taxless_total_price.sum": {        "sum": {          "field": "taxless_total_price"        }      },      "total_quantity.max": {        "max": {          "field": "total_quantity"        }      },      "order_id.cardinality": {        "cardinality": {          "field": "order_id"        }      }    }  },  "dest": {    "index": "ecommerce-customers"  },  "retention_policy": {    "time": {      "field": "order_date",      "max_age": "60d"    }  }  }--------------------------------------------------// TEST[skip:setup kibana sample data]====--. Optional: Create the destination index.+--If the destination index does not exist, it is created the first time you startyour {transform}. A pivot transform deduces the mappings for the destinationindex from the source indices and the transform aggregations. If there arefields in the destination index that are derived from scripts (for example, if you use<<search-aggregations-metrics-scripted-metric-aggregation,`scripted_metrics`>>or <<search-aggregations-pipeline-bucket-script-aggregation,`bucket_scripts`>>aggregations), they're created with <<dynamic-mapping,dynamic mappings>>. Youcan use the preview {transform} API to preview the mappings it will use for thedestination index. In {kib}, if you copied the API request to your clipboard, paste it into the console, then refer to the `generated_dest_index` object in the API response.NOTE: {transforms-cap} might have more configuration options provided by the APIs than the options available in {kib}. For example, you can set an ingest pipeline for `dest` by calling the <<put-transform>>. For all the {transform} configuration options, refer to the <<transform-apis,documentation>>..API example[%collapsible]====[source,console-result]--------------------------------------------------{  "preview" : [    {      "total_quantity" : {        "max" : 2,        "sum" : 118.0      },      "taxless_total_price" : {        "sum" : 3946.9765625      },      "customer_id" : "10",      "order_id" : {        "cardinality" : 59      }    },    ...  ],  "generated_dest_index" : {    "mappings" : {      "_meta" : {        "_transform" : {          "transform" : "transform-preview",          "version" : {            "created" : "8.0.0"          },          "creation_date_in_millis" : 1621991264061        },        "created_by" : "transform"      },      "properties" : {        "total_quantity.sum" : {          "type" : "double"        },        "total_quantity" : {          "type" : "object"        },        "taxless_total_price" : {          "type" : "object"        },        "taxless_total_price.sum" : {          "type" : "double"        },        "order_id.cardinality" : {          "type" : "long"        },        "customer_id" : {          "type" : "keyword"        },        "total_quantity.max" : {          "type" : "integer"        },        "order_id" : {          "type" : "object"        }      }    },    "settings" : {      "index" : {        "number_of_shards" : "1",        "auto_expand_replicas" : "0-1"      }    },    "aliases" : { }  }}--------------------------------------------------// TESTRESPONSE[skip:needs sample data]====In some instances the deduced mappings might be incompatible with the actualdata. For example, numeric overflows might occur or dynamically mapped fieldsmight contain both numbers and strings. To avoid this problem, create yourdestination index before you start the {transform}. For more information, seethe <<indices-create-index,create index API>>..API example[%collapsible]====You can use the information from the {transform} preview to create thedestination index. For example:[source,console]--------------------------------------------------PUT /ecommerce-customers{  "mappings": {    "properties": {      "total_quantity.sum" : {        "type" : "double"      },      "total_quantity" : {        "type" : "object"      },      "taxless_total_price" : {        "type" : "object"      },      "taxless_total_price.sum" : {        "type" : "double"      },      "order_id.cardinality" : {        "type" : "long"      },      "customer_id" : {        "type" : "keyword"      },      "total_quantity.max" : {        "type" : "integer"      },      "order_id" : {        "type" : "object"      }    }  }}--------------------------------------------------// TEST====--. Start the {transform}.+--TIP: Even though resource utilization is automatically adjusted based on thecluster load, a {transform} increases search and indexing load on yourcluster while it runs. If you're experiencing an excessive load, however, youcan stop it.You can start, stop, reset, and manage {transforms} in {kib}:[role="screenshot"]image::images/manage-transforms.png["Managing {transforms} in {kib}"]Alternatively, you can use the<<start-transform,start {transforms}>>, <<stop-transform,stop {transforms}>> and <<reset-transform, reset {transforms}>> APIs.If you reset a {transform}, all checkpoints, states, and the destination index (if it was created by the {transform}) are deleted. The {transform} is ready to start again as if it had just been created..API example[%collapsible]====[source,console]--------------------------------------------------POST _transform/ecommerce-customer-transform/_start--------------------------------------------------// TEST[skip:setup kibana sample data]====TIP: If you chose a batch {transform}, it is a single operation that has asingle checkpoint. You cannot restart it when it's complete. {ctransforms-cap}differ in that they continually increment and process checkpoints as new sourcedata is ingested.--. Explore the data in your new index.+--For example, use the *Discover* application in {kib}:[role="screenshot"]image::images/ecommerce-results.png["Exploring the new index in {kib}"]--. Optional: Create another {transform}, this time using the `latest` method.+--This method populates the destination index with the latest documents for eachunique key value. For example, you might want to find the latest orders (sortedby the `order_date` field) for each customer or for each country and region.[role="screenshot"]image::images/ecommerce-latest1.png["Creating a latest {transform} in {kib}"].API example[%collapsible]====[source,console]--------------------------------------------------POST _transform/_preview{  "source": {    "index": "kibana_sample_data_ecommerce",    "query": {      "bool": {        "filter": {          "term": {"currency": "EUR"}        }      }    }  },  "latest": {    "unique_key": ["geoip.country_iso_code", "geoip.region_name"],    "sort": "order_date"  }}--------------------------------------------------// TEST[skip:set up sample data]====TIP: If the destination index does not exist, it is created the first time you start your {transform}. Unlike pivot {transforms}, however, latest {transforms} do not deduce mapping definitions when they create the index. Instead, they use dynamic mappings. To use explicit mappings, create the destination index before you start the {transform}.--. If you do not want to keep a {transform}, you can delete it in{kib} or use the <<delete-transform,delete {transform} API>>. By default, whenyou delete a {transform}, its destination index and {kib} index patterns remain.Now that you've created simple {transforms} for {kib} sample data, considerpossible use cases for your own data. For more ideas, see<<transform-usage>> and <<transform-examples>>.
 |