Expose AI model from Hugging Face using vLLM
You want to expose to the world an API endpoint to allow your end-users to access an AI Model from Hugging Face? This tutorial is for you!
Choose your VM offer

Set-up your instance

Connect to your instance


Use your model

Common errors
Instance RAM insufficient
Container not running yet
Port used or unavailable
Last updated