scaleway · fpagny · Apr 24, 2026 · Apr 24, 2026 · Apr 24, 2026 · Apr 27, 2026
@@ -17,6 +17,7 @@ Our API supports the most popular models for [Chat](/generative-apis/how-to/quer
 |-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|
 | Qwen      | `qwen3.5-397b-a17b`     | 250k | 16k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/Qwen/Qwen3.5-397B-A17B) |
 | Mistral | `mistral-small-3.2-24b-instruct-2506`  | 128k  | 32k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/mistralai/Mistral-Small-3.2-24B-Instruct-2506) |
+| Google (Preview)   | `gemma-4-26b-a4b-it`  | 256k  | 32k | [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0) | [HF](https://huggingface.co/google/gemma-4-26B-A4B-it) |
 | Google (Preview)   | `gemma-3-27b-it`  | 40k  | 8k | [Gemma](https://ai.google.dev/gemma/terms) | [HF](https://huggingface.co/google/gemma-3-27b-it) |
 | H | `holo2-30b-a3b`  | 22k  | 32k | [CC-BY-NC-4.0](https://spdx.org/licenses/CC-BY-NC-4.0) | [HF](https://huggingface.co/Hcompany/Holo2-30B-A3B) |
 

@@ -20,6 +20,8 @@ A quick overview of available models in Scaleway's catalog and their core attrib
 | [`whisper-large-v3`](#whisper-large-v3) | OpenAI | - | Audio transcription | L4, L40S, H100, H100-SXM-2 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
 | [`qwen3.5-397b-a17b`](#qwen35-397b-a17b) | Qwen | 250k | Text, Code, Vision | H100-SXM-8 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
 | [`qwen3-235b-a22b-instruct-2507`](#qwen3-235b-a22b-instruct-2507) | Qwen | 250k | Text | H100-SXM-2 (40k), H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
+| [`gemma-4-31b-it`](#gemma-4-31b-it) | Google | 250k | Text, Vision | H100 (60k), H100-SXM-2, H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
+| [`gemma-4-26b-a4b-it`](#gemma-4-26b-a4b-it) | Google | 250k | Text, Vision | H100, H100-SXM-2, H100-SXM-4 | [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
 | [`gemma-3-27b-it`](#gemma-3-27b-it) | Google | 40k | Text, Vision | H100, H100-2 | [Gemma](https://ai.google.dev/gemma/terms) |
 | [`llama-3.3-70b-instruct`](#llama-33-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.3 Community](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) |
 | [`llama-3.1-70b-instruct`](#llama-31-70b-instruct) | Meta | 128k | Text | H100 (15k), H100-2 | [Llama 3.1 Community](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE) |
@@ -58,6 +60,8 @@ A quick overview of available models in Scaleway's catalog and their core attrib
 | `whisper-large-v3` | - | - | English, French, German, Chinese, Japanese, Korean and 81 additional languages  |
 | `qwen3.5-397b-a17b` | Yes | Yes | English, French, German, Chinese, Japanese, Korean and 113 additional languages and dialects |
 | `qwen3-235b-a22b-instruct-2507` | Yes | Yes | English, French, German, Chinese, Japanese, Korean and 113 additional languages and dialects |
+| `gemma-4-31b-it` | Yes | Yes | English, Chinese, Japanese, Korean and 136 additional languages |
+| `gemma-4-26b-a4b-it` | Yes | Yes | English, Chinese, Japanese, Korean and 136 additional languages |
 | `gemma-3-27b-it` | Yes | Partial | English, Chinese, Japanese, Korean and 31 additional languages |
 | `llama-3.3-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai |
 | `llama-3.1-70b-instruct` | Yes | Yes | English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai |
@@ -98,6 +102,37 @@ A quick overview of available models in Scaleway's catalog and their core attrib
   Vision models can understand and analyze images, not generate them. You will use it through the /v1/chat/completions endpoint.
 </Message>
 
+### Gemma-4-31b-it
+Gemma-4-31b-it is a frontier small model as of April 2026 to perform agentic and reasoning tasks on many languages.
+
+#### Model names
+```
+google/gemma-4-31b-it:bf16
+```
+| Attribute | Value |
+|-----------|-------|
+| Supports parallel tool calling | Yes |
+| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
+| Supported image formats | PNG, JPEG, WEBP, and non-animated GIFs |
+| Maximum image resolution (pixels) | 896x896 |
+| Token dimension (pixels)| 64x64 |
+
+### Gemma-4-26b-a4b-it
+Gemma-4-26b-a4b-it is a frontier small model as of April 2026 to perform agentic and reasoning tasks on many languages.
+This model has a Mixture-of-Expert (MoE) architecture, providing significant throughput and fitting on a single H100 GPU while supporting its maximum context size.
+
+#### Model names
+```
+google/gemma-4-26b-a4b-it:bf16
+```
+| Attribute | Value |
+|-----------|-------|
+| Supports parallel tool calling | Yes |
+| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
+| Supported image formats | PNG, JPEG, WEBP, and non-animated GIFs |
+| Maximum image resolution (pixels) | 896x896 |
+| Token dimension (pixels)| 64x64 |
+
 ### Gemma-3-27b-it
 Gemma-3-27b-it is a model developed by Google to perform text processing and image analysis on many languages.
 The model was not trained specifically to output function / tool call tokens. Hence function calling is currently supported, but reliability remains limited.
@@ -175,6 +210,7 @@ This model was released as a frontier reasoning model on 16th February 2026.
 | Attribute | Value |
 |-----------|-------|
 | Supports parallel tool calling | Yes |
+| Supported reasoning efforts | `none`, `low`, `medium`, `high` |
 | Supported images formats | PNG, JPEG, WEBP, and non-animated GIFs |
 | Supported video formats | MP4, MPEG, MOV, OGG and WEBM |
 | Maximum image resolution (pixels) | 4096x4096 |
@@ -293,6 +329,7 @@ Currently, this model should be used through Responses API, as Chat Completion d
 | Attribute | Value |
 |-----------|-------|
 | Supports parallel tool calling | Yes |
+| Supported reasoning efforts | `low`, `medium`, `high` |
 
 #### Model name
 ```