| 1234567891011121314151617181920212223242526272829303132333435363738394041424344 | [role="xpack"][[ml-functions]]= Function referenceThe {ml-features} include analysis functions that provide a wide variety offlexible ways to analyze data for anomalies.When you create {anomaly-jobs}, you specify one or more detectors, which definethe type of analysis that needs to be done. If you are creating your job byusing {ml} APIs, you specify the functions in detector configuration objects.If you are creating your job in {kib}, you specify the functions differentlydepending on whether you are creating single metric, multi-metric, or advancedjobs.//For a demonstration of creating jobs in {kib}, see <<ml-getting-started>>.Most functions detect anomalies in both low and high values. In statisticalterminology, they apply a two-sided test. Some functions offer low and highvariations (for example, `count`, `low_count`, and `high_count`). These variationsapply one-sided tests, detecting anomalies only when the values are low orhigh, depending one which alternative is used.You can specify a `summary_count_field_name` with any function except `metric`.When you use `summary_count_field_name`, the {ml} features expect the inputdata to be pre-aggregated. The value of the `summary_count_field_name` fieldmust contain the count of raw events that were summarized. In {kib}, use the**summary_count_field_name** in advanced {anomaly-jobs}. Analyzing aggregatedinput data provides a significant boost in performance. For more information, see<<ml-configuring-aggregation>>.If your data is sparse, there may be gaps in the data which means you might haveempty buckets. You might want to treat these as anomalies or you might want thesegaps to be ignored. Your decision depends on your use case and what is importantto you. It also depends on which functions you use. The `sum` and `count`functions are strongly affected by empty buckets. For this reason, there are`non_null_sum` and `non_zero_count` functions, which are tolerant to sparse data.These functions effectively ignore empty buckets.* <<ml-count-functions,Count functions>>* <<ml-geo-functions,Geographic functions>>* <<ml-info-functions,Information content functions>>* <<ml-metric-functions,Metric functions>>* <<ml-rare-functions,Rare functions>>* <<ml-sum-functions,Sum functions>>* <<ml-time-functions,Time functions>>
 |