فهرست منبع

[ML] Add missing params to start deployment api REST spec (#86621)

Updates the REST spec for the start deployment api.
In particular, adds params: `number_of_allocations`, `threads_per_allocation`,
and `queue_capacity`.
Dimitris Athanasiou 3 سال پیش
والد
کامیت
d212cd38ea
1فایلهای تغییر یافته به همراه18 افزوده شده و 0 حذف شده
  1. 18 0
      rest-api-spec/src/main/resources/rest-api-spec/api/ml.start_trained_model_deployment.json

+ 18 - 0
rest-api-spec/src/main/resources/rest-api-spec/api/ml.start_trained_model_deployment.json

@@ -28,6 +28,24 @@
       ]
     },
     "params":{
+      "number_of_allocations":{
+        "type":"int",
+        "description": "The number of model allocations on each node where the model is deployed.",
+        "required": false,
+        "default": 1
+      },
+      "threads_per_allocation":{
+        "type":"int",
+        "description": "The number of threads used by each model allocation during inference.",
+        "required": false,
+        "default": 1
+      },
+      "queue_capacity":{
+        "type":"int",
+        "description": "Controls how many inference requests are allowed in the queue at a time.",
+        "required": false,
+        "default": 1024
+      },
       "timeout":{
         "type":"time",
         "required":false,