Edit an inference instance

How to edit an inference instance?

Even after launching, it is possible to update the settings of your inference instances. This is particularly relevant if you need to edit your autoscalling limits, according to the usage of your endpoint.

From Inference, select "Edit". Then, fill up the form with the new parameters to want to pass into your instance.

What can I edit from a running inference instance?

Here are the element that can be customized from an Running inference instance:

GPU/CPU Flavor
Region
Startup Command
Containers and autoscaling triggers
Environment variables

PreviousAutoscaling limits NextChat with Endpoint

Last updated 5 months ago

Was this helpful?