Add support for Gemma3 VLM architecture by xenova · Pull Request #1601 · huggingface/transformers.js

xenova · 2026-03-23T22:10:21Z

Example

import {
  AutoProcessor,
  AutoModelForImageTextToText,
  load_image,
  TextStreamer,
} from "@huggingface/transformers";

// Load model and processor
const model_id = "onnx-community/gemma-3-4b-it-ONNX";
const processor = await AutoProcessor.from_pretrained(model_id);
const model = await AutoModelForImageTextToText.from_pretrained(model_id, {
  device: "webgpu",
  dtype: "q4",
});

// Define the list of messages
const messages = [
  {
    role: "user",
    content: [
      { type: "image" },
      { type: "text", text: "Describe this image." },
    ],
  },
];

// Prepare inputs
const prompt = processor.apply_chat_template(messages, { add_generation_prompt: true });
const url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bee.jpg";
const image = await load_image(url);
const inputs = await processor(prompt, image, { add_special_tokens: false });

const outputs = await model.generate({
  ...inputs,
  max_new_tokens: 2048,
  streamer: new TextStreamer(processor.tokenizer, {
    skip_prompt: true,
    // callback_function: (text) => { /* Do something with the streamed output */ },
  }),
});

// Decode output
const decoded = processor.batch_decode(
  outputs.slice(null, [inputs.input_ids.dims.at(-1), null]),
  { skip_special_tokens: true },
);
console.log(decoded[0]);

HuggingFaceDocBuilderDev · 2026-03-23T22:12:18Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

xenova added 9 commits March 19, 2026 00:45

Add gemma3 text-only text-generation unit test

02c7a2a

Add gemma3 modeling tests

3a5f449

add gemma3 pre-processing

f0407e6

Add Gemma3ForConditionalGeneration class

9f9abb2

update registry

47cb6b7

formatting

030c8ba

test that we don't load sessions unnecessarily

0a8c5ad

support hybrid multimodal & decoder-only loading

d314b82

Improve cross-architecture loading

38b3ec5

xenova changed the title ~~Add support for Gemma3~~ Add support for Gemma3 VLM architecture Mar 23, 2026

xenova merged commit a97b51b into main Mar 23, 2026
4 checks passed

xenova deleted the add-gemma3 branch March 23, 2026 22:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Gemma3 VLM architecture#1601

Add support for Gemma3 VLM architecture#1601
xenova merged 9 commits intomainfrom
add-gemma3

xenova commented Mar 23, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xenova commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xenova commented Mar 23, 2026 •

edited

Loading