GPU Cloud instances
The following endpoints allow to create and manage GPU Cloud instances from the API.
Get the list of available offers
Use this endpoint when you want to explore the different GPU instance options available for your project. This is particularly useful when planning new deployments and needing to compare offers to find the best fit in terms of cost and performance.
The API Key secret should be sent through this header to authenticate the request.
Returns the list of available offers for instances
API key invalid
Not found
GET /gpu-cloud/instances/offers HTTP/1.1
Host: 
x-api-key: text
Accept: */*
[
  {
    "gpuName": "A6000",
    "gpuCount": 8,
    "nvlink": true,
    "deploymentType": "vm",
    "instanceId": "A6000x8",
    "cloudInitAvailable": true,
    "cloud": {
      "_id": "668f9d8df664e3c028ce5de9",
      "name": "AZ_01"
    },
    "configuration": {
      "ramGB": 100,
      "storageGB": 1000,
      "vCpu": 100,
      "vRamGB": 100,
      "os": [
        "ubuntu22.04_cuda12.2_shade_os"
      ],
      "interconnect": "pcie"
    },
    "hourlyPrice": 10,
    "availability": [
      {
        "region": "us-central-2",
        "name": "US, Central",
        "countryCode": "US",
        "available": true
      }
    ]
  }
]Create a GPU Cloud instance
Use this endpoint when you're ready to launch a new GPU instance for a specific project or task. This is the crucial step to deploy new computing resources.
The API Key secret should be sent through this header to authenticate the request.
my-machine-name668f9d8df664e3c028ce5de9A6000668f9d8df664e3c028ce5de9us-east-1668f9d8df664e3c028ce5de9ssh-rsa AAAAB3NzaC1yc2EADAQABAAAAgQDsl21/gI87CHcHX21sNAwckJNdCXMlOKcQVZBxEsEE9FNpxWu8VIJl8kNBYm0oBiDvTOgtC7ru+mz2MJBsBlnFxT732LUDwf1otPPzQ3jUEtFFfYqWN4lyGJ4tLLTuPmXCrNM+rt00nzlGS0WFvjszjw== [email protected]Return the created instance
API key invalid
Not found
POST /gpu-cloud/instances HTTP/1.1
Host: 
x-api-key: text
Content-Type: application/json
Accept: */*
Content-Length: 625
{
  "name": "my-machine-name",
  "cloudProvider": "668f9d8df664e3c028ce5de9",
  "instanceId": "A6000",
  "volumeId": "668f9d8df664e3c028ce5de9",
  "region": "us-east-1",
  "vm": {
    "os": "ubuntu22.04_cuda12.2_shade_os",
    "base64CloudInitScript": "IyEvYmluL2Jhc2gKCiMgRW5kbGVzcyBsb29wCndoaWxlIHRydWUKZG8KICAgICMgRmV0Y2ggYSBjYXQgZmFjdCB3aXRoIG..."
  },
  "dockerContainer": {
    "image": "vllm/vllm-openai:latest"
  },
  "sshKeyId": "668f9d8df664e3c028ce5de9",
  "sshKey": "ssh-rsa AAAAB3NzaC1yc2EADAQABAAAAgQDsl21/gI87CHcHX21sNAwckJNdCXMlOKcQVZBxEsEE9FNpxWu8VIJl8kNBYm0oBiDvTOgtC7ru+mz2MJBsBlnFxT732LUDwf1otPPzQ3jUEtFFfYqWN4lyGJ4tLLTuPmXCrNM+rt00nzlGS0WFvjszjw== [email protected]"
}{
  "_id": "668f9d8df664e3c028ce5de9",
  "name": "my-machine-name",
  "provider": "AZ_01",
  "region": {
    "name": "USA-Central-1",
    "countryCode": "US"
  },
  "volumes": [
    "668f9d8df664e3c028ce5de9"
  ],
  "status": "active",
  "gpuCount": 1,
  "gpuModel": "A100",
  "ram": 100,
  "storage": 100,
  "vramPerGpu": 100,
  "vcpus": 1,
  "interconnect": "VM",
  "nvlink": true,
  "os": "Ubuntu 20.04",
  "ip": "192.168.1.1",
  "sshUser": "sesterce",
  "sshPort": 22,
  "dockerImage": "ubuntu/20.04",
  "dockerCommand": "docker run -it ubuntu/20.04",
  "hourlyPrice": 100,
  "deletedAt": "2021-01-01T00:00:00.000Z",
  "createdAt": "2021-01-01T00:00:00.000Z",
  "updatedAt": "2021-01-01T00:00:00.000Z"
}Make sure you filled the right OS name, such as ubuntu24.04_cuda12.4_shade_os. You can retrieve the os available with the /gpu-cloud/instances/offers endpoint.
Get the list of instances created
This endpoint is essential for users who want an overview of all the instances they have created. It is ideal for managing and monitoring current resources.
The API Key secret should be sent through this header to authenticate the request.
Returns the list of user instances
API key invalid
API key not found
GET /gpu-cloud/instances HTTP/1.1
Host: 
x-api-key: text
Accept: */*
[
  {
    "_id": "668f9d8df664e3c028ce5de9",
    "name": "my-machine-name",
    "provider": "AZ_01",
    "region": {
      "name": "USA-Central-1",
      "countryCode": "US"
    },
    "volumes": [
      "668f9d8df664e3c028ce5de9"
    ],
    "status": "active",
    "gpuCount": 1,
    "gpuModel": "A100",
    "ram": 100,
    "storage": 100,
    "vramPerGpu": 100,
    "vcpus": 1,
    "interconnect": "VM",
    "nvlink": true,
    "os": "Ubuntu 20.04",
    "ip": "192.168.1.1",
    "sshUser": "sesterce",
    "sshPort": 22,
    "dockerImage": "ubuntu/20.04",
    "dockerCommand": "docker run -it ubuntu/20.04",
    "hourlyPrice": 100,
    "deletedAt": "2021-01-01T00:00:00.000Z",
    "createdAt": "2021-01-01T00:00:00.000Z",
    "updatedAt": "2021-01-01T00:00:00.000Z"
  }
]Get details about a GPU Cloud instance created
This endpoint is useful when you need to check the details and status of a specific instance, for example, for troubleshooting or configuration verification.
The API Key secret should be sent through this header to authenticate the request.
Return the instance's details
API key invalid
Not found
GET /gpu-cloud/instances/{id} HTTP/1.1
Host: 
x-api-key: text
Accept: */*
{
  "_id": "668f9d8df664e3c028ce5de9",
  "name": "my-machine-name",
  "provider": "AZ_01",
  "status": "active",
  "region": {
    "name": "USA-Central-1",
    "countryCode": "US",
    "id": "us-central-1"
  },
  "volumes": [
    "668f9d8df664e3c028ce5de9"
  ],
  "sshKey": {
    "_id": "668f9d8df664e3c028ce5de9",
    "name": "my-ssh-key",
    "publicKey": "ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDA",
    "createdAt": "2021-01-01T00:00:00.000Z",
    "updatedAt": "2021-01-01T00:00:00.000Z"
  },
  "createdAt": "2021-01-01T00:00:00.000Z",
  "deletedAt": "2021-01-01T00:00:00.000Z",
  "isPending": true,
  "hourlyPrice": 0.15,
  "ip": "127.0.0.1",
  "sshUser": "sesterce",
  "sshPort": 22,
  "portForwards": [
    {
      "internalPort": 80,
      "externalPort": 8080
    }
  ],
  "gpuCount": 1,
  "gpuModel": "A100",
  "ram": 100,
  "storage": 100,
  "vramPerGpu": 100,
  "vcpus": 1,
  "interconnect": "pcie",
  "nvlink": true,
  "os": "ubuntu24.04",
  "dockerImage": "ubuntu/20.04",
  "dockerCommand": "docker run -it ubuntu/20.04"
}Delete a GPU Cloud instance
Use this endpoint when you want to free up resources by deleting an instance that is no longer needed, optimizing your resource usage and costs.
The API Key secret should be sent through this header to authenticate the request.
Instance deleted successfully
No content
API key invalid
Not found
DELETE /gpu-cloud/instances/{id} HTTP/1.1
Host: 
x-api-key: text
Accept: */*
No content
Last updated
Was this helpful?