Sesterce Cloud Doc
  • 👋Welcome on Sesterce Cloud
    • 🚀Get Started!
      • Account creation
      • Manage your account
      • Payment & Billing
        • Invoicing
  • 🚀Compute instances
    • Compute Instance configuration
      • Persistent storage (volumes)
      • SSH Keys
    • Terminal connection
  • 💬AI Inference instances
    • Inference Instance configuration
      • Select your Flavor
      • Select your regions
      • Autoscaling limits
    • Edit an inference instance
    • Chat with Endpoint
  • ▶️Manage your instances
  • 🔗API Reference
    • Authentication
    • GPU Cloud instances
    • SSH Keys
    • Volumes
    • Inference Instances
  • 📗Tutorials
    • Expose AI model from Hugging Face using vLLM
Powered by GitBook
On this page
  • Select your model
  • How to select Public model?
  • How to use Private custom model?

Was this helpful?

  1. AI Inference instances

Inference Instance configuration

PreviousAI Inference instancesNextSelect your Flavor

Last updated 2 months ago

Was this helpful?

Select your model

How to select Public model?

Model catalog

If you want to infere with one of the best-known models, you can select it in our catalog of pre-charged models list.

Public custom model

If you want to select a custom model that is publicly hosted on Docker Hub, for example, click "New Deployment" and fill your docker tag in the "Public Model" text field.

How to use Private custom model?

If you want to deploy a private custom template, you'll need to create a registry. Click on “Private Model", and add your Registry informations.

To create a registry, perform the following steps:

1

Registry name

Give your registry a name consisting of lowercase Latin characters, which can be separated by dashes.

2

Location link

Provide the link to the location where your AI model is stored. We’ll use this URL to retrieve the model during deployment.

3

Username

Specify the username you use to access the storage location of your AI model.

4

Password

Enter the password required to access the model.

💬