Feat; Add support for Wan/Qwen TAEHV decoding #937

stduhpf · 2025-11-03T15:25:59Z

Model weights: https://github.com/madebyollin/taehv/blob/main/taew2_1.pth

Only tested "successfuly" for decoding Qwen-Image outputs, still need some work to support video models. Encoding seems to work too, at least in img2img mode.

.\bin\Release\sd.exe --diffusion-model ..\..\ComfyUI\models\diffusion_models\qwen-image-Q8_0.gguf --vae ..\..\ComfyUI\models\vae\qwen_image_vae.safetensors --qwen2vl ..\..\ComfyUI\models\text_encoders\Qwen2.5-VL-7B-Instruct-Q8_0.gguf -p '一个穿着"QWEN"标志的T恤的中国美女正拿着黑色的马克笔面相镜头微笑。她身后的玻璃板上手写体写着 “一、Qwen-Image的技术路线：探索视觉生成基础模型的极限，开创理解与生成一体化的未来。二、Qwen-Image的模型特色：1、复杂文字渲染。支持中英渲染、自动布局； 2、精准图像编辑。支持文字编辑、物体增减、风格变换。三、Qwen-Image的未来愿景：赋能专业内容创作、助力生成式AI发展。”' --cfg-scale 2.5 --sampling-method euler -v --offload-to-cpu -H 1024 -W 1024 --diffusion-fa --flow-shift 3 --tae ..\ComfyUI\models\vae_approx\taew2_1.pth --vae-conv-direct

Speedup and memory saving aren't that impressive yet, maybe it can be improved further?

stduhpf · 2025-11-03T15:28:49Z

Sorry for the unrelated whitespace changes and the debug spam, will fix later

stduhpf · 2025-11-03T21:04:53Z

Oh a new version of the taew2.1 weights just came out, coincidentally.

Old Weights	New Weights

stduhpf · 2025-11-03T23:17:52Z

Now tae decoding for the outputs of Wan2.1 models (and Wan2.2 A14B) works in txt2img mode.

Video decoding is running as well, but the results are obviously incorrect (flashing lights warning)

If someone can see what I'm doing wrong when decoding videos, let me know.

madebyollin · 2025-12-11T00:03:14Z

After fixing the three bugs mentioned in review, image results look correct (tested on GH200 with -DSD_CUDA=ON). I didn't check video.

diffs

diff --git a/tae.hpp b/tae.hpp
index ad0bd37..6a7951f 100644
--- a/tae.hpp
+++ b/tae.hpp
@@ -224,7 +224,7 @@ public:
         h      = conv1->forward(ctx, h);
         h      = ggml_relu_inplace(ctx->ggml_ctx, h);
         h      = conv2->forward(ctx, h);
-        h      = ggml_relu_inplace(ctx->ggml_ctx, h);
+        // h      = ggml_relu_inplace(ctx->ggml_ctx, h);
 
         auto skip = x;
         if (has_skip_conv) {
@@ -323,7 +323,7 @@ public:
         for (int i = 0; i < num_layers; i++) {
             for (int j = 0; j < num_blocks; j++) {
                 auto block = std::dynamic_pointer_cast<MemBlock>(blocks[std::to_string(index++)]);
-                auto mem   = ggml_pad(ctx->ggml_ctx, h, 0, 0, 0, 1);
+                auto mem   = ggml_pad_ext(ctx->ggml_ctx, h, 0, 0, 0, 0, 0, 0, 1, 0);
                 mem        = ggml_view_4d(ctx->ggml_ctx, mem, h->ne[0], h->ne[1], h->ne[2], h->ne[3], h->nb[1], h->nb[2], h->nb[3], 0);
                 h          = block->forward(ctx, h, mem);
             }
@@ -341,7 +341,7 @@ public:
         h              = last_conv->forward(ctx, h);
 
         // shape(W, H, 3, T+3) => shape(W, H, 3, T)
-        h = ggml_view_4d(ctx->ggml_ctx, h, h->ne[0], h->ne[1], h->ne[2], h->ne[3] - 3, h->nb[1], h->nb[2], h->nb[3], 0);
+        h = ggml_view_4d(ctx->ggml_ctx, h, h->ne[0], h->ne[1], h->ne[2], h->ne[3] - 3, h->nb[1], h->nb[2], h->nb[3], 3*h->nb[3]);
         return h;

tae.hpp

Co-authored-by: Ollin Boer Bohan <[email protected]>

stduhpf · 2025-12-11T19:13:25Z

Video is still completely broken, but image decoding works very well now.

stduhpf mentioned this pull request Nov 6, 2025

[Bug] TAESD with WAN-2.1 and 2.2 dump core #946

Open

CarlGao4 mentioned this pull request Dec 9, 2025

[Feature] TAEHV Support with WAN weights [TAEW2_2] #1069

Open

madebyollin suggested changes Dec 11, 2025

View reviewed changes

tae.hpp Outdated Show resolved Hide resolved

tae.hpp Outdated Show resolved Hide resolved

tae.hpp Outdated Show resolved Hide resolved

stduhpf added 7 commits December 11, 2025 19:20

Add support for Wan2.1 TAEHV decoding

b671126

--tae instead of --taesd

776385d

progress towards video support

bd0c5e2

Wan2.1 decode not crashing anymore (still broken)

56339d4

Less broken video decode + remove log spam

9d0005a

Taehv fixes

dd75498

Co-authored-by: Ollin Boer Bohan <[email protected]>

Adapt to lastest changes

fde734b

stduhpf force-pushed the taehv branch from d04fd90 to fde734b Compare December 11, 2025 18:30

taew2.1 encode support

baf122d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat; Add support for Wan/Qwen TAEHV decoding #937

Feat; Add support for Wan/Qwen TAEHV decoding #937

stduhpf commented Nov 3, 2025 •

edited

Loading

Uh oh!

stduhpf commented Nov 3, 2025

Uh oh!

stduhpf commented Nov 3, 2025 •

edited

Loading

Uh oh!

stduhpf commented Nov 3, 2025 •

edited

Loading

Uh oh!

madebyollin commented Dec 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stduhpf commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat; Add support for Wan/Qwen TAEHV decoding #937

Are you sure you want to change the base?

Feat; Add support for Wan/Qwen TAEHV decoding #937

Conversation

stduhpf commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 3, 2025

Uh oh!

stduhpf commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

madebyollin commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stduhpf commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stduhpf commented Nov 3, 2025 •

edited

Loading

stduhpf commented Nov 3, 2025 •

edited

Loading

stduhpf commented Nov 3, 2025 •

edited

Loading

madebyollin commented Dec 11, 2025 •

edited

Loading