Edit an inference instance

How to edit an inference instance?

Even after launching, it is possible to update the settings of your inference instances. This is particularly relevant if you need to edit your autoscalling limits, according to the usage of your endpoint.

From Inference, select "Edit". Then, fill up the form with the new parameters to want to pass into your instance.

What can I edit from a running inference instance?

Here are the element that can be customized from an Running inference instance:

  • GPU/CPU Flavor

  • Region

  • Startup Command

  • Containers and autoscaling triggers

  • Environment variables

Last updated

Was this helpful?