Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Our API supports the most popular models for [Chat](/generative-apis/how-to/quer
|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|
| Qwen | `qwen3.5-397b-a17b` | 250k | 16k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/Qwen/Qwen3.5-397B-A17B) |
| Mistral | `mistral-small-3.2-24b-instruct-2506` | 128k | 32k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/mistralai/Mistral-Small-3.2-24B-Instruct-2506) |
| Google (Preview) | `gemma-4-26b-a4b-it` | 256k | 32k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/google/gemma-4-26B-A4B-it) |
| Google (Preview) | `gemma-3-27b-it` | 40k | 8k | [Gemma](https://ai.google.dev/gemma/terms) | [HF](https://huggingface.co/google/gemma-3-27b-it) |
| H | `holo2-30b-a3b` | 22k | 32k | [CC-BY-NC-4.0](https://spdx.org/licenses/CC-BY-NC-4.0) | [HF](https://huggingface.co/Hcompany/Holo2-30B-A3B) |

Expand Down
37 changes: 37 additions & 0 deletions pages/managed-inference/reference-content/model-catalog.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,8 @@ A quick overview of available models in Scaleway's catalog and their core attrib
| [`whisper-large-v3`](#whisper-large-v3) | OpenAI | - | Audio transcription | L4, L40S, H100, H100-SXM-2 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| [`qwen3.5-397b-a17b`](#qwen35-397b-a17b) | Qwen | 250k | Text, Code, Vision | H100-SXM-8 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| [`qwen3-235b-a22b-instruct-2507`](#qwen3-235b-a22b-instruct-2507) | Qwen | 250k | Text | H100-SXM-2 (40k), H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| [`gemma-4-31b-it`](#gemma-4-31b-it) | Google | 250k | Text, Vision | H100 (60k), H100-SXM-2, H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| [`gemma-4-26b-a4b-it`](#gemma-4-26b-a4b-it) | Google | 250k | Text, Vision | H100, H100-SXM-2, H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| [`gemma-3-27b-it`](#gemma-3-27b-it) | Google | 40k | Text, Vision | H100, H100-2 | [Gemma](https://ai.google.dev/gemma/terms) |
| [`llama-3.3-70b-instruct`](#llama-33-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.3 Community](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) |
| [`llama-3.1-70b-instruct`](#llama-31-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.1 Community](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE) |
Expand Down Expand Up @@ -58,6 +60,8 @@ A quick overview of available models in Scaleway's catalog and their core attrib
| `whisper-large-v3` | - | - | English, French, German, Chinese, Japanese, Korean and 81 additional languages |
| `qwen3.5-397b-a17b` | Yes | Yes | English, French, German, Chinese, Japanese, Korean and 113 additional languages and dialects |
| `qwen3-235b-a22b-instruct-2507` | Yes | Yes | English, French, German, Chinese, Japanese, Korean and 113 additional languages and dialects |
| `gemma-4-31b-it` | Yes | Yes | English, Chinese, Japanese, Korean and 136 additional languages |
| `gemma-4-26b-a4b-it` | Yes | Yes | English, Chinese, Japanese, Korean and 136 additional languages |
| `gemma-3-27b-it` | Yes | Partial | English, Chinese, Japanese, Korean and 31 additional languages |
| `llama-3.3-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai |
| `llama-3.1-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai |
Expand Down Expand Up @@ -98,6 +102,37 @@ A quick overview of available models in Scaleway's catalog and their core attrib
Vision models can understand and analyze images, not generate them. You will use it through the /v1/chat/completions endpoint.
</Message>

### Gemma-4-31b-it
Gemma-4-31b-it is a frontier small model as of April 2026 to perform agentic and reasoning tasks on many languages.

#### Model names
```
google/gemma-4-31b-it:bf16
```
| Attribute | Value |
|-----------|-------|
| Supports parallel tool calling | Yes |
| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
| Supported image formats | PNG, JPEG, WEBP, and non-animated GIFs |
| Maximum image resolution (pixels) | 896x896 |
| Token dimension (pixels)| 64x64 |

### Gemma-4-26b-a4b-it
Gemma-4-26b-a4b-it is a frontier small model as of April 2026 to perform agentic and reasoning tasks on many languages.
This model has a Mixture-of-Expert (MoE) architecture, providing significant throughput and fitting on a single H100 GPU while supporting its maximum context size.

#### Model names
```
google/gemma-4-26b-a4b-it:bf16
```
| Attribute | Value |
|-----------|-------|
| Supports parallel tool calling | Yes |
| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
| Supported image formats | PNG, JPEG, WEBP, and non-animated GIFs |
| Maximum image resolution (pixels) | 896x896 |
| Token dimension (pixels)| 64x64 |

### Gemma-3-27b-it
Gemma-3-27b-it is a model developed by Google to perform text processing and image analysis on many languages.
The model was not trained specifically to output function / tool call tokens. Hence function calling is currently supported, but reliability remains limited.
Expand Down Expand Up @@ -175,6 +210,7 @@ This model was released as a frontier reasoning model on 16th February 2026.
| Attribute | Value |
|-----------|-------|
| Supports parallel tool calling | Yes |
| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
| Supported images formats | PNG, JPEG, WEBP, and non-animated GIFs |
| Supported video formats | MP4, MPEG, MOV, OGG and WEBM |
| Maximum image resolution (pixels) | 4096x4096 |
Expand Down Expand Up @@ -293,6 +329,7 @@ Currently, this model should be used through Responses API, as Chat Completion d
| Attribute | Value |
|-----------|-------|
| Supports parallel tool calling | Yes |
| Supported reasoning efforts | `low`, `medium`, `high` |

#### Model name
```
Expand Down
Loading