4 年之前 · bbfe962cae
--- a/docs/reference/transform/examples.asciidoc
+++ b/docs/reference/transform/examples.asciidoc
@@ -11,11 +11,12 @@ from your data. All the examples use one of the
 
															 {kibana-ref}/add-sample-data.html[{kib} sample datasets]. For a more detailed, 
														
 
															 step-by-step example, see <<ecommerce-transforms>>.
														
 
															-* <<example-best-customers>>
														
 
															-* <<example-airline>>
														
 
															-* <<example-clientips>>
														
 
															-* <<example-last-log>>
														
 
															-
														
 
															+* <<example-best-customers>> 
														
 
															+* <<example-airline>> 
														
 
															+* <<example-clientips>> 
														
 
															+* <<example-last-log>> 
														
 
															+* <<example-bytes>> 
														
 
															+* <<example-customer-names>>
														
 
															 [[example-best-customers]]
														
 
															 == Finding your best customers
														
@@ -344,18 +345,21 @@ This {transform} makes it easier to answer questions such as:
 
															 This example uses the web log sample data set to find the last log from an IP 
														
 
															 address. Let's use the `latest` type of {transform} in continuous mode. It 
														
 
															-copies the most recent document for each unique key from the source index to the destination index
														
 
															-and updates the destination index as new data comes into the source index. 
														
 
															+copies the most recent document for each unique key from the source index to the 
														
 
															+destination index and updates the destination index as new data comes into the 
														
 
															+source index. 
														
 
															 Pick the `clientip` field as the unique key; the data is grouped by this field. 
														
 
															 Select `timestamp` as the date field that sorts the data chronologically. For 
														
 
															 continuous mode, specify a date field that is used to identify new documents, 
														
 
															 and an interval between checks for changes in the source index.
														
 
															- Let's assume that we're interested in retaining documents only for IP addresses that appeared recently in the log. You can define a retention policy and specify a date field that is used to calculate 
														
 
															-the age of a document. This example uses the same date field that is used to 
														
 
															-sort the data. Then set the maximum age of a document; documents that are older 
														
 
															-than the value you set will be removed from the destination index.
														
 
															+Let's assume that we're interested in retaining documents only for IP addresses 
														
 
															+that appeared recently in the log. You can define a retention policy and specify 
														
 
															+a date field that is used to calculate the age of a document. This example uses 
														
 
															+the same date field that is used to sort the data. Then set the maximum age of a 
														
 
															+document; documents that are older than the value you set will be removed from 
														
 
															+the destination index.
														
 
															 This {transform} creates the destination index that contains the latest login 
														
 
															 date for each client IP. As the {transform} runs in continuous mode, the 
														
@@ -483,3 +487,206 @@ The search result shows you data like this for each client IP:
 
															 This {transform} makes it easier to answer questions such as:
														
 
															 * What was the most recent log event associated with a specific IP address?
														
 
															+
														
 
															+
														
 
															+[[example-bytes]]
														
 
															+== Finding client IPs that sent the most bytes to the server
														
 
															+
														
 
															+This example uses the web log sample data set to find the client IP that sent 
														
 
															+the most bytes to the server in every hour. The example uses a `pivot` 
														
 
															+{transform} with a <<search-aggregations-metrics-top-metrics,`top_metrics`>> 
														
 
															+aggregation.
														
 
															+
														
 
															+Group the data by a <<_date_histogram,date histogram>> on the time field with an 
														
 
															+interval of one hour. Use a 
														
 
															+<<search-aggregations-metrics-max-aggregation,max aggregation>> on the `bytes` 
														
 
															+field to get the maximum amount of data that is sent to the server. Without 
														
 
															+the `max` aggregation, the API call still returns the client IP that sent the 
														
 
															+most bytes, however, the amount of bytes that it sent is not returned. In the 
														
 
															+`top_metrics` property, specify `clientip` and `geo.src`, then sort them by the 
														
 
															+`bytes` field in descending order. The {transform} returns the client IP that 
														
 
															+sent the biggest amount of data and the 2-letter ISO code of the corresponding 
														
 
															+location.
														
 
															+
														
 
															+[source,console]
														
 
															+----------------------------------
														
 
															+POST _transform/_preview
														
 
															+{
														
 
															+  "source": {
														
 
															+    "index": "kibana_sample_data_logs"
														
 
															+  },
														
 
															+  "pivot": {
														
 
															+    "group_by": { <1>
														
 
															+      "timestamp": {
														
 
															+        "date_histogram": {
														
 
															+          "field": "timestamp",
														
 
															+          "fixed_interval": "1h"
														
 
															+        }
														
 
															+      }
														
 
															+    },
														
 
															+    "aggregations": {
														
 
															+      "bytes.max": { <2>
														
 
															+        "max": {
														
 
															+          "field": "bytes"
														
 
															+        }
														
 
															+      },
														
 
															+      "top": {
														
 
															+        "top_metrics": { <3>
														
 
															+          "metrics": [
														
 
															+            {
														
 
															+              "field": "clientip"
														
 
															+            },
														
 
															+            {
														
 
															+              "field": "geo.src"
														
 
															+            }
														
 
															+          ],
														
 
															+          "sort": {
														
 
															+            "bytes": "desc"
														
 
															+          }
														
 
															+        }
														
 
															+      }
														
 
															+    }
														
 
															+  }
														
 
															+}
														
 
															+----------------------------------
														
 
															+// TEST[skip:setup kibana sample data]
														
 
															+
														
 
															+<1> The data is grouped by a date histogram of the time field with a one hour 
														
 
															+interval.
														
 
															+<2> Calculates the maximum value of the `bytes` field. 
														
 
															+<3> Specifies the fields (`clientip` and `geo.src`) of the top document to 
														
 
															+return and the sorting method (document with the highest `bytes` value).
														
 
															+
														
 
															+The API call above returns a response similar to this:
														
 
															+
														
 
															+[source,js]
														
 
															+----------------------------------
														
 
															+{
														
 
															+  "preview" : [
														
 
															+    {
														
 
															+      "top" : {
														
 
															+        "clientip" : "223.87.60.27",
														
 
															+        "geo.src" : "IN"
														
 
															+      },
														
 
															+      "bytes" : {
														
 
															+        "max" : 6219
														
 
															+      },
														
 
															+      "timestamp" : "2021-04-25T00:00:00.000Z"
														
 
															+    },
														
 
															+    {
														
 
															+      "top" : {
														
 
															+        "clientip" : "99.74.118.237",
														
 
															+        "geo.src" : "LK"
														
 
															+      },
														
 
															+      "bytes" : {
														
 
															+        "max" : 14113
														
 
															+      },
														
 
															+      "timestamp" : "2021-04-25T03:00:00.000Z"
														
 
															+    },
														
 
															+    {
														
 
															+      "top" : {
														
 
															+        "clientip" : "218.148.135.12",
														
 
															+        "geo.src" : "BR"
														
 
															+      },
														
 
															+      "bytes" : {
														
 
															+        "max" : 4531
														
 
															+      },
														
 
															+      "timestamp" : "2021-04-25T04:00:00.000Z"
														
 
															+    },
														
 
															+    ...
														
 
															+  ]
														
 
															+}
														
 
															+----------------------------------
														
 
															+// NOTCONSOLE
														
 
															+
														
 
															+[[example-customer-names]]
														
 
															+== Getting customer name and email address by customer ID
														
 
															+
														
 
															+This example uses the ecommerce sample data set to create an entity-centric 
														
 
															+index based on customer ID, and to get the customer name and email address by 
														
 
															+using the `top_metrics` aggregation.
														
 
															+
														
 
															+Group the data by `customer_id`, then add a `top_metrics` aggregation where the 
														
 
															+`metrics` are the `email`, the `customer_first_name.keyword`, and the 
														
 
															+`customer_last_name.keyword` fields. Sort the `top_metrics` by `order_date` in 
														
 
															+descending order. The API call looks like this:
														
 
															+
														
 
															+[source,console]
														
 
															+----------------------------------
														
 
															+POST _transform/_preview 
														
 
															+{
														
 
															+  "source": {
														
 
															+    "index": "kibana_sample_data_ecommerce"
														
 
															+  },
														
 
															+  "pivot": {
														
 
															+    "group_by": { <1>
														
 
															+      "customer_id": {
														
 
															+        "terms": {
														
 
															+          "field": "customer_id"
														
 
															+        }
														
 
															+      }
														
 
															+    },
														
 
															+    "aggregations": {
														
 
															+      "last": {
														
 
															+        "top_metrics": { <2>
														
 
															+          "metrics": [
														
 
															+            {
														
 
															+              "field": "email"
														
 
															+            },
														
 
															+            {
														
 
															+              "field": "customer_first_name.keyword"
														
 
															+            },
														
 
															+            {
														
 
															+              "field": "customer_last_name.keyword"
														
 
															+            }
														
 
															+          ],
														
 
															+          "sort": {
														
 
															+            "order_date": "desc"
														
 
															+          }
														
 
															+        }
														
 
															+      }
														
 
															+    }
														
 
															+  }
														
 
															+}
														
 
															+----------------------------------
														
 
															+// TEST[skip:setup kibana sample data]
														
 
															+
														
 
															+<1> The data is grouped by a `terms` aggregation on the `customer_id` field.
														
 
															+<2> Specifies the fields to return (email and name fields) in a descending order 
														
 
															+by the order date.
														
 
															+
														
 
															+The API returns a response that is similar to this:
														
 
															+
														
 
															+[source,js]
														
 
															+----------------------------------
														
 
															+ { 
														
 
															+  "preview" : [
														
 
															+    {
														
 
															+      "last" : {
														
 
															+        "customer_last_name.keyword" : "Long",
														
 
															+        "customer_first_name.keyword" : "Recip",
														
 
															+        "email" : "recip@long-family.zzz"
														
 
															+      },
														
 
															+      "customer_id" : "10"
														
 
															+    },
														
 
															+    {
														
 
															+      "last" : {
														
 
															+        "customer_last_name.keyword" : "Jackson",
														
 
															+        "customer_first_name.keyword" : "Fitzgerald",
														
 
															+        "email" : "fitzgerald@jackson-family.zzz"
														
 
															+      },
														
 
															+      "customer_id" : "11"
														
 
															+    },
														
 
															+    {
														
 
															+      "last" : {
														
 
															+        "customer_last_name.keyword" : "Cross",
														
 
															+        "customer_first_name.keyword" : "Brigitte",
														
 
															+        "email" : "brigitte@cross-family.zzz"
														
 
															+      },
														
 
															+      "customer_id" : "12"
														
 
															+    },
														
 
															+    ...
														
 
															+  ]
														
 
															+}
														
 
															+----------------------------------
														
 
															+// NOTCONSOLE
														
--- a/docs/reference/transform/painless-examples.asciidoc
+++ b/docs/reference/transform/painless-examples.asciidoc
@@ -77,9 +77,8 @@ returned by each shard and returns the document with the latest timestamp
 
															 (`last_doc`). In the response, the top hit (in other words, the `latest_doc`) is 
														
 
															 nested below the `latest_doc` field.
														
 
															-Check the
														
 
															-<<scripted-metric-aggregation-scope,scope of scripts>>
														
 
															-for detailed explanation on the respective scripts.
														
 
															+Check the <<scripted-metric-aggregation-scope,scope of scripts>> for detailed 
														
 
															+explanation on the respective scripts.
														
 
															 You can retrieve the last value in a similar way: