Skip to content

feat: support Qwen2D VAE#1714

Open
stduhpf wants to merge 2 commits into
leejet:masterfrom
stduhpf:qwen2d-vae
Open

feat: support Qwen2D VAE#1714
stduhpf wants to merge 2 commits into
leejet:masterfrom
stduhpf:qwen2d-vae

Conversation

@stduhpf

@stduhpf stduhpf commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Summary

Wan2.1 VAE, but 3D convolutions are pruned into 2D convs for decoding single frames.

Download Link: https://huggingface.co/Anzhc/Qwen2D-VAE

(tensor naming convention in this file is not the same as the one currently supported in stable-diffusion.cpp, fixed by #1713)

Related Issue / Discussion

#1610

Additional Information

Decode:

Qwen2D-VAE:

[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 4563.99 MB(VRAM)
[DEBUG] model_manager.cpp:727  - model manager releasing compute params ( 48.28 MB, 104 tensors) from ROCm0
[DEBUG] vae.hpp:215  - computing vae decode graph completed, taking 2.02s

qwen_image_vae:

[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 7420.44 MB(VRAM)
[DEBUG] model_manager.cpp:727  - model manager releasing compute params (136.46 MB, 104 tensors) from ROCm0
[DEBUG] vae.hpp:215  - computing vae decode graph completed, taking 5.84s

Encode:

Qwen2D-VAE:

[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 2863.76 MB(VRAM)
[DEBUG] model_manager.cpp:727  - model manager releasing compute params ( 36.85 MB, 82 tensors) from ROCm0
[DEBUG] vae.hpp:161  - computing vae encode graph completed, taking 1.18s

qwen_image_vae:

[DEBUG] ggml_extend.hpp:2007 - wan_vae compute buffer size: 7627.54 MB(VRAM)
[DEBUG] model_manager.cpp:727  - model manager releasing compute params (101.20 MB, 82 tensors) from ROCm0
[DEBUG] vae.hpp:161  - computing vae encode graph completed, taking 2.94s

Checklist

@daniandtheweb

Copy link
Copy Markdown
Contributor

That's amazing, with this VAE tiling is no longer required on vulkan when using qwen image or anima models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants