Qwen3-VL-2B export by carinapeng · Pull Request #68 · apple/coreai-models

carinapeng · 2026-06-29T17:00:16Z

CoreAI export for Qwen3-VL-2B

Produce vlm bundle (main decoder + embedding + vision + tokenizer) conforming to the CoreAISequentialVLMEngine contract, tight co-design with Add VLM inference infrastructure: engine, protocol, and CLI support #65

Add Qwen3-VL vision-language model export support. - gpu/qwen3_vl.py: Qwen3VLForCausalLM (text decoder, input_ids) and Qwen3VLForCausalLMEmbeddings (inputs_embeds variant for VLM fusion) - primitives/macos/cache_scatter.py: slice_scatter-based explicit KV cache (avoids stateful mutable_slice_update Metal kernel issues) - export_qwen3vl_explicit_kv.py: text decoder export (inputs_embeds, explicit KV) - export_vision_encoder_224.py: vision encoder export (448x448 -> 196 visual tokens) - registry.py: register qwen3_vl model entry

stikves · 2026-06-30T01:11:48Z

+
+Usage:
+    cd <repo-root>
+    uv run python python/export_qwen3vl.py [--max-ctx 4096] [--num-layers N]


shall we use the existing pattern?

uv run coreai.vlm.export qwen3-vl

or similar

And also update

Model registry

A new model specific subfolder (or models/vlm, or models/qwen)

README?

We don't have coreai.vlm.* namespace yet, at least in pyproject, this is a bigger refactor, I think we can have a follow up for it since the scope of this is a first model support for VLM infra?

Should be a simple refactor, and will help with discoverability.

stikves

Overall looks good, but we might want to move the model to a better home

models/qwen3-vl or models/vlm?

…models into carina/qwen3vl-export

carinapeng added 3 commits June 25, 2026 13:14

stateful-KV decoder + self-contained vision encoder

185996b

Linting

8efc99c

carinapeng requested a review from stikves June 29, 2026 17:35

Merge branch 'main' into carina/qwen3vl-export

29dc339

carinapeng marked this pull request as ready for review June 29, 2026 23:14

carinapeng mentioned this pull request Jun 29, 2026

Add VLM inference infrastructure: engine, protocol, and CLI support #65

Merged

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread python/src/coreai_models/primitives/macos/cache_scatter.py

stikves reviewed Jun 30, 2026

View reviewed changes

Comment thread python/src/coreai_models/models/gpu/__init__.py Outdated

stikves reviewed Jun 30, 2026

View reviewed changes

carinapeng added 5 commits June 29, 2026 20:11

Comment

bc4133f

Merge remote-tracking branch 'upstream/main' into carina/qwen3vl-export

57af0de

Rename, platform specific

c4d88e6

Unit test VLM protocol

cb3cc1b

Merge branch 'carina/qwen3vl-export' of github.com:carinapeng/coreai-…

9295ac3

…models into carina/qwen3vl-export

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3-VL-2B export#68

Qwen3-VL-2B export#68
carinapeng wants to merge 9 commits into
apple:mainfrom
carinapeng:carina/qwen3vl-export

carinapeng commented Jun 29, 2026

Uh oh!

Uh oh!

stikves Jun 30, 2026

Uh oh!

carinapeng Jun 30, 2026

Uh oh!

stikves Jun 30, 2026

Uh oh!

Uh oh!

stikves left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

carinapeng commented Jun 29, 2026

Uh oh!

Uh oh!

stikves Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

carinapeng Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

stikves Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stikves left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants