GPU Cloud instances
The following endpoints allow to create and manage GPU Cloud instances from the API.
Get the list of available offers
Use this endpoint when you want to explore the different GPU instance options available for your project. This is particularly useful when planning new deployments and needing to compare offers to find the best fit in terms of cost and performance.
The API Key secret should be sent through this header to authenticate the request.
GET /gpu-cloud/instances/offers HTTP/1.1
Host:
x-api-key: text
Accept: */*
[
{
"gpuName": "A6000",
"gpuCount": 8,
"nvlink": true,
"deploymentType": "vm",
"instanceId": "A6000x8",
"cloudInitAvailable": true,
"cloud": {
"_id": "668f9d8df664e3c028ce5de9",
"name": "AZ_01"
},
"configuration": {
"ramGB": 100,
"storageGB": 1000,
"vCpu": 100,
"vRamGB": 100,
"os": [
"ubuntu22.04_cuda12.2_shade_os"
],
"interconnect": "pcie"
},
"hourlyPrice": 10,
"availability": [
{
"region": "us-central-2",
"name": "US, Central",
"countryCode": "US",
"available": true
}
]
}
]
Create a GPU Cloud instance
Use this endpoint when you're ready to launch a new GPU instance for a specific project or task. This is the crucial step to deploy new computing resources.
The API Key secret should be sent through this header to authenticate the request.
my-machine-name
668f9d8df664e3c028ce5de9
A6000
668f9d8df664e3c028ce5de9
us-east-1
668f9d8df664e3c028ce5de9
POST /gpu-cloud/instances HTTP/1.1
Host:
x-api-key: text
Content-Type: application/json
Accept: */*
Content-Length: 409
{
"name": "my-machine-name",
"cloudProvider": "668f9d8df664e3c028ce5de9",
"instanceId": "A6000",
"volumeId": "668f9d8df664e3c028ce5de9",
"region": "us-east-1",
"vm": {
"os": "ubuntu22.04_cuda12.2_shade_os",
"base64CloudInitScript": "IyEvYmluL2Jhc2gKCiMgRW5kbGVzcyBsb29wCndoaWxlIHRydWUKZG8KICAgICMgRmV0Y2ggYSBjYXQgZmFjdCB3aXRoIG..."
},
"dockerContainer": {
"image": "vllm/vllm-openai:latest"
},
"sshKeyId": "668f9d8df664e3c028ce5de9"
}
{
"_id": "668f9d8df664e3c028ce5de9",
"name": "my-machine-name",
"provider": "AZ_01",
"region": {
"name": "USA-Central-1",
"countryCode": "US"
},
"volumes": [
"668f9d8df664e3c028ce5de9"
],
"status": "active",
"gpuCount": 1,
"gpuModel": "A100",
"ram": 100,
"storage": 100,
"vramPerGpu": 100,
"vcpus": 1,
"interconnect": "VM",
"nvlink": true,
"os": "Ubuntu 20.04",
"ip": "192.168.1.1",
"sshUser": "sesterce",
"sshPort": 22,
"dockerImage": "ubuntu/20.04",
"dockerCommand": "docker run -it ubuntu/20.04",
"hourlyPrice": 100,
"deletedAt": "2021-01-01T00:00:00.000Z",
"createdAt": "2021-01-01T00:00:00.000Z",
"updatedAt": "2021-01-01T00:00:00.000Z"
}
Make sure you filled the right OS name, such as ubuntu24.04_cuda12.4_shade_os
. You can retrieve the os available with the /gpu-cloud/instances/offers
endpoint.
Get the list of instances created
This endpoint is essential for users who want an overview of all the instances they have created. It is ideal for managing and monitoring current resources.
The API Key secret should be sent through this header to authenticate the request.
GET /gpu-cloud/instances HTTP/1.1
Host:
x-api-key: text
Accept: */*
[
{
"_id": "668f9d8df664e3c028ce5de9",
"name": "my-machine-name",
"provider": "AZ_01",
"region": {
"name": "USA-Central-1",
"countryCode": "US"
},
"volumes": [
"668f9d8df664e3c028ce5de9"
],
"status": "active",
"gpuCount": 1,
"gpuModel": "A100",
"ram": 100,
"storage": 100,
"vramPerGpu": 100,
"vcpus": 1,
"interconnect": "VM",
"nvlink": true,
"os": "Ubuntu 20.04",
"ip": "192.168.1.1",
"sshUser": "sesterce",
"sshPort": 22,
"dockerImage": "ubuntu/20.04",
"dockerCommand": "docker run -it ubuntu/20.04",
"hourlyPrice": 100,
"deletedAt": "2021-01-01T00:00:00.000Z",
"createdAt": "2021-01-01T00:00:00.000Z",
"updatedAt": "2021-01-01T00:00:00.000Z"
}
]
Get details about a GPU Cloud instance created
This endpoint is useful when you need to check the details and status of a specific instance, for example, for troubleshooting or configuration verification.
The API Key secret should be sent through this header to authenticate the request.
GET /gpu-cloud/instances/{id} HTTP/1.1
Host:
x-api-key: text
Accept: */*
{
"_id": "668f9d8df664e3c028ce5de9",
"name": "my-machine-name",
"provider": "AZ_01",
"status": "active",
"region": {
"name": "USA-Central-1",
"countryCode": "US",
"id": "us-central-1"
},
"volumes": [
"668f9d8df664e3c028ce5de9"
],
"sshKey": {
"_id": "668f9d8df664e3c028ce5de9",
"name": "my-ssh-key",
"publicKey": "ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDA",
"createdAt": "2021-01-01T00:00:00.000Z",
"updatedAt": "2021-01-01T00:00:00.000Z"
},
"createdAt": "2021-01-01T00:00:00.000Z",
"deletedAt": "2021-01-01T00:00:00.000Z",
"isPending": true,
"hourlyPrice": 0.15,
"ip": "127.0.0.1",
"sshUser": "sesterce",
"sshPort": 22,
"portForwards": [
{
"internalPort": 80,
"externalPort": 8080
}
],
"gpuCount": 1,
"gpuModel": "A100",
"ram": 100,
"storage": 100,
"vramPerGpu": 100,
"vcpus": 1,
"interconnect": "pcie",
"nvlink": true,
"os": "ubuntu24.04",
"dockerImage": "ubuntu/20.04",
"dockerCommand": "docker run -it ubuntu/20.04"
}
Delete a GPU Cloud instance
Use this endpoint when you want to free up resources by deleting an instance that is no longer needed, optimizing your resource usage and costs.
The API Key secret should be sent through this header to authenticate the request.
DELETE /gpu-cloud/instances/{id} HTTP/1.1
Host:
x-api-key: text
Accept: */*
No content
Last updated
Was this helpful?