Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
ID of the endpoint group to update
Example:
4242
Body
application/json
Minimum floor load in perf units/s (token/s for LLMs)
Example:
0
Target capacity utilization (fraction, max 1.0)
Example:
0.9
Cold/stopped instance capacity target as multiple of hot capacity target
Example:
2.5
Min number of workers to keep 'cold' when you have no load
Example:
5
Max number of workers your endpoint group can have
Example:
20
Deployment endpoint name
Example:
"my_endpoint"
Response
Success response
Example:
true