flux-2-klein-9B lora training error

when i train flux-2-klein-9B lora, this message is shown and program is blocked. error shows Qwen3_4B, i think flux-2-klein needs Qwen3_8B. i have no idea. help!

[rank0]: Traceback (most recent call last):
[rank0]:   File "/mnt/i/AI/diffusion-pipe/train.py", line 495, in <module>
[rank0]:     dataset_manager.cache()
[rank0]:   File "/mnt/i/AI/diffusion-pipe/utils/dataset.py", line 1148, in cache
[rank0]:     self._handle_task(task)
[rank0]:   File "/mnt/i/AI/diffusion-pipe/venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
[rank0]:     return func(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/utils/dataset.py", line 1187, in _handle_task
[rank0]:     self.submodels[id].load_model_if_needed()
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/base.py", line 362, in load_model_if_needed
[rank0]:     self._model = self._load_fn()
[rank0]:                   ^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/base.py", line 414, in load_fn
[rank0]:     return comfy.sd.load_clip(ckpt_paths=paths, clip_type=clip_type)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 970, in load_clip
[rank0]:     return load_text_encoder_state_dicts(clip_data, embedding_directory=embedding_directory, clip_type=clip_type, model_options=model_options)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 1221, in load_text_encoder_state_dicts
[rank0]:     m, u = clip.load_sd(c)
[rank0]:            ^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 264, in load_sd
[rank0]:     return self.cond_stage_model.load_sd(sd)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd1_clip.py", line 711, in load_sd
[rank0]:     return getattr(self, self.clip).load_sd(sd)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd1_clip.py", line 305, in load_sd
[rank0]:     return self.transformer.load_state_dict(sd, strict=False)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/mnt/i/AI/diffusion-pipe/venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 2593, in load_state_dict
[rank0]:     raise RuntimeError(
[rank0]: RuntimeError: Error(s) in loading state_dict for Qwen3_4B:
[rank0]:        size mismatch for model.embed_tokens.weight: copying a param with shape torch.Size([151936, 4096]) from checkpoint, the shape in current model is torch.Size([151936, 2560]).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flux-2-klein-9B lora training error #497

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

flux-2-klein-9B lora training error #497

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions