Edit an inference instance
Last updated
Was this helpful?
Last updated
Was this helpful?
Even after launching, it is possible to update the settings of your inference instances. This is particularly relevant if you need to edit your autoscalling limits, according to the usage of your endpoint.
From Inference, select "Edit". Then, fill up the form with the new parameters to want to pass into your instance.
Here are the element that can be customized from an Running inference instance:
GPU/CPU Flavor
Region
Startup Command
Containers and autoscaling triggers
Environment variables