> For the complete documentation index, see [llms.txt](https://docs.sesterce.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.sesterce.com/ai-inference-instances/inference-instance-configuration/select-your-flavor.md).

# Select your Flavor

### When to choose GPU or CPU flavors?&#x20;

According to your needs, you can choose from two options: CPU and GPU Flavors.&#x20;

{% tabs %}
{% tab title="When to choose GPU Flavor?" %}
GPU Flavor is ideal for **inference of complex deep learning models**, image processing, or multimedia content generation.
{% endtab %}

{% tab title="When to choose GPU flavor?" %}
CPU Flavor is more **dedicated to light tasks** as simple text processing algorithms or structured data processing, that do not require a very low latency time.
{% endtab %}
{% endtabs %}

### Which GPU Flavors are available for AI Inference?&#x20;

<table data-card-size="large" data-view="cards" data-full-width="false"><thead><tr><th></th><th></th><th></th><th data-hidden data-card-cover data-type="files"></th></tr></thead><tbody><tr><td><strong>NVIDIA L40S</strong></td><td>GPU specialized for inference tasks able to accelerate multiple workloads.</td><td></td><td><a href="/files/i2CeSBskEZt7sRGVoWF1">/files/i2CeSBskEZt7sRGVoWF1</a></td></tr><tr><td><strong>NVIDIA H100 TensorCore</strong></td><td>Up to 30 times acceleration of LLM processing. Ideal for complete models with up to 30 billion parameters.</td><td></td><td><a href="/files/GUHRlyhTe5hqsFyMrfrD">/files/GUHRlyhTe5hqsFyMrfrD</a></td></tr><tr><td><strong>NVIDIA A100 TensorCore</strong></td><td>A100 provide up to 20X higher performance over the NVIDIA Volta with zero code changes and an additional 2X boost with automatic mixed precision and FP16</td><td></td><td><a href="/files/KnDn69dTaCQlRBEBSUTZ">/files/KnDn69dTaCQlRBEBSUTZ</a></td></tr></tbody></table>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.sesterce.com/ai-inference-instances/inference-instance-configuration/select-your-flavor.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
