categorize.md 646 B

% This is generated by ESQL's AbstractFunctionTestCase. Do not edit it. See ../README.md for how to regenerate it.

Supported function named parameters

output_format : (keyword) The output format of the categories. Defaults to regex.

similarity_threshold : (integer) The minimum percentage of token weight that must match for text to be added to the category bucket. Must be between 1 and 100. The larger the value the narrower the categories. Larger values will increase memory usage and create narrower categories. Defaults to 70.

analyzer : (keyword) Analyzer used to convert the field into tokens for text categorization.