Add model converter module by fuheaven · Pull Request #1 · fuheaven/LightX2V

fuheaven · 2025-12-22T09:25:12Z

No description provided.

Co-authored-by: Yang Yong (雍洋) <yongyang1030@163.com>

Co-authored-by: qinxinyi <qxy118045534@163.com>

Co-authored-by: yihuiwen <yihuiwen@sensetime.com>

…ing mode

Co-authored-by: qinxinyi <qxy118045534@163.com>

Co-authored-by: gushiqiao <975033167@qq.com>

Feature: 1. added mlu590 bfloat16, single-gpu and multi-gpus inference. 2. added mlu590 int8 inference.

Thanks to HunyuanVideo Team and ModelTC Team. --------- Co-authored-by: gushiqiao <975033167@qq.com> Co-authored-by: gushiqiao <77222802+gushiqiao@users.noreply.github.com> Co-authored-by: chendingyu <chendingyu1@sensetime.com> Co-authored-by: XHPlus <xhplus@163.com> Co-authored-by: wangshankun <wangshankun2011@hotmail.com> Co-authored-by: STwangyingrui <86730325+STwangyingrui@users.noreply.github.com> Co-authored-by: root <root@pt-80f094c20fc44a8cad096e5f3dbc962e-worker-0.pt-80f094c20fc44a8cad096e5f3dbc962e.ns-devsft-3460edd0.svc.cluster.local>

Added new model links and recommendations for lightweight autoencoders.

--linear_dtype and --linear_quant_dtype unify as --linear_type

Updated README_zh.md with new features and model support.

### 单卡 ```bash python examples/simple_launch.py ``` ```python # examples/simple_launch.py from lightx2v import LightGenerator generator = LightGenerator( model_path="/path/to/Wan2.1-T2V-1.3B", model_cls="wan2.1", task="t2v", ) video_path = generator.generate( prompt="Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage.", negative_prompt="镜头晃动，色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走", seed=42, save_result_path="output.mp4", ) ``` ### 多卡 ```bash export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 examples/multi_launch.py ``` --------- Co-authored-by: gushiqiao <975033167@qq.com>

ring_attn: fp8_comm & kv_fusion --------- Co-authored-by: root <root@pt-72be2ccd01a14fa18a4b18c6c347f823-worker-0.pt-72be2ccd01a14fa18a4b18c6c347f823.ns-devsft-3460edd0.svc.cluster.local>

use_kv_fusion → use_tensor_fusion --------- Co-authored-by: root <root@pt-72be2ccd01a14fa18a4b18c6c347f823-worker-0.pt-72be2ccd01a14fa18a4b18c6c347f823.ns-devsft-3460edd0.svc.cluster.local>

# kernel-base text encoder 集成了sgl_kernel的优化算子，同时启用了flash attention - flash attention3 - Rmsnorm: use sgl_kernel: from sgl_kernel.elementwise import rmsnorm # service text encoder 使用分离部署，多个推理进程可共享同一个 encoder 服务，可处理并发请求 - Triton自动调优 LIGHTLLM_TRITON_AUTOTUNE_LEVEL=1 - lightllm集成flash attention3, rmsnorm等优化算子 ``` ================================================================================ COMPARISON SUMMARY ================================================================================ Encoder | Time (ms) | Speedup | Cosine Sim | 端到端精度 -------------------------------------------------------------------------------- Baseline (HF) | 92.17 | 1.00x | 1.0000 | PASS Kernel (Flash-2) | 81.23 | 1.13x | 0.9900 | PASS Service (Optimized) | 71.21 | 1.29x | 0.9492 | PASS ================================================================================ ``` > 上表为纯推理时间对比，service mode还需考虑上网络通信开销(约5ms), 服务router开销等

…ModelTC#791) Co-authored-by: gushiqiao <975033167>

Co-authored-by: gushiqiao <975033167>

ModelTC#810) …e tiling --------- Co-authored-by: gushiqiao <975033167>

…TC#776) `input_info.return_result_tensor` was ignored for all image generation. Outputting the tensor can be useful for post-processing (such as NSFW checking), without reloading the file from disk. I noticed that for video models, they do not return the tensor directly; they return a map of {"video": tensor} [here](https://github.com/ModelTC/LightX2V/blob/38f9ac0513d0a097df1dd49e95ec4cc73ec426cb/lightx2v/models/runners/default_runner.py#L445). I believe this is for compatibility with ComfyUI. If that's the case, we should only return the tensor and move ComfyUI-specific patterns to the ComfyUI wrapper codebase. What do you think?

Co-authored-by: yihuiwen <yihuiwen@sensetime.com>

helloyongyang and others added 30 commits November 10, 2025 17:12

Update docker && configs

adc66e8

Support flashinfer for nbhd attn

db6296f

Update configs (ModelTC#462)

ad56443

[bugfix]: fix conflict (ModelTC#465)

8c1d91e

Co-authored-by: Yang Yong (雍洋) <yongyang1030@163.com>

Add dist warmup code

2a31ba4

update froentend (ModelTC#470)

d8558a0

Co-authored-by: qinxinyi <qxy118045534@163.com>

fix sitemap.xml router (ModelTC#472)

44e215f

Co-authored-by: yihuiwen <yihuiwen@sensetime.com>

[feat]: support matrix game2 universal, gta_drive, templerun & stream…

9826b8c

…ing mode

update readme (ModelTC#473)

ffb0da8

update readme (ModelTC#474)

63f0486

Update user token auto refresh (ModelTC#477)

2559b3e

Co-authored-by: qinxinyi <qxy118045534@163.com>

[Fix] remove d2h of cpu-offload infer (ModelTC#476)

51be3ad

update wan22 5090 config (ModelTC#478)

37c5cfc

Co-authored-by: gushiqiao <975033167@qq.com>

update configs

226e71e

Change weight swapping to block swapping (ModelTC#482)

989a30a

Mlu590 deployment (ModelTC#453)

fcc2a41

Feature: 1. added mlu590 bfloat16, single-gpu and multi-gpus inference. 2. added mlu590 int8 inference.

fix: progress_callback (ModelTC#483)

3efc43f

update configs (ModelTC#485)

d6c51a0

update configs (ModelTC#486)

bbf7344

Update README with new model information (ModelTC#487)

77ed54f

Added new model links and recommendations for lightweight autoencoders.

Update Readme (ModelTC#488)

b4a1034

Update Readme (ModelTC#490)

4beb6eb

update convert (ModelTC#481)

bbd164c

--linear_dtype and --linear_quant_dtype unify as --linear_type

Revise README_zh.md for updates and enhancements (ModelTC#489)

424150f

Updated README_zh.md with new features and model support.

Update link for HunyuanVideo-1.5 in README (ModelTC#491)

abdbe71

Update Readme (ModelTC#492)

d996a81

update readme (ModelTC#495)

75a5662

Delete lightx2v.egg-info directory (ModelTC#496)

9a116d5

helloyongyang and others added 22 commits January 19, 2026 20:28

update input_info (ModelTC#774)

88829a6

update readme (ModelTC#795)

8e65a32

update readme (ModelTC#796)

59df998

update readme (ModelTC#797)

99192e1

ring_attn_fp8_kv_fusion (ModelTC#787)

dc0045b

ring_attn: fp8_comm & kv_fusion --------- Co-authored-by: root <root@pt-72be2ccd01a14fa18a4b18c6c347f823-worker-0.pt-72be2ccd01a14fa18a4b18c6c347f823.ns-devsft-3460edd0.svc.cluster.local>

ring_attn.py (ModelTC#798)

2ed2f27

use_kv_fusion → use_tensor_fusion --------- Co-authored-by: root <root@pt-72be2ccd01a14fa18a4b18c6c347f823-worker-0.pt-72be2ccd01a14fa18a4b18c6c347f823.ns-devsft-3460edd0.svc.cluster.local>

[Feat] LTX-2 support (t2v/i2v, offload, cfg parallel, fp8-per_tensor) (…

7d6ef50

…ModelTC#791) Co-authored-by: gushiqiao <975033167>

update readme (ModelTC#799)

5238224

update examples (ModelTC#801)

9a0b30b

Co-authored-by: gushiqiao <975033167>

update readme (ModelTC#803)

8b45406

[BugFix]: f2v seq parallel padding error (ModelTC#800)

e8f9202

Remove dtype conversion for actual bias in mm_weight.py (ModelTC#804)

9a0c270

Add omni system prompt (ModelTC#805)

060a988

fix apply multiple loras (ModelTC#806)

1c10948

Co-authored-by: gushiqiao <975033167>

fix qwen sp

5c54b9c

updata wan2.1-1.3B.md (ModelTC#809)

edabf36

[Feat] Support LTX2 sequence parallel, tensor parallel, and VAE decod… (

746d44b

ModelTC#810) …e tiling --------- Co-authored-by: gushiqiao <975033167>

add reject status (ModelTC#811)

a426327

Co-authored-by: yihuiwen <yihuiwen@sensetime.com>

Benchmark 2601 (ModelTC#812)

bef76dc

add int8 mm for dcu platform

4fdc249

fuheaven force-pushed the dcu branch from 8dc50ad to 4fdc249 Compare January 26, 2026 04:45

fuheaven added 3 commits January 26, 2026 14:25

merge main

597ac4f

adapt lora input

b40637a

format code

b94b58e

fuheaven force-pushed the dcu branch from 8b8c82a to b94b58e Compare January 26, 2026 07:59

add new script for hygon dcu

8ecdd5b

fuheaven force-pushed the dcu branch from 0859cca to 8ecdd5b Compare January 26, 2026 09:19

fuheaven force-pushed the main branch from f42f42d to b74f415 Compare March 13, 2026 07:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add model converter module#1

Add model converter module#1
fuheaven wants to merge 381 commits into
mainfrom
dcu

fuheaven commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

fuheaven commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants