Skip to content

flux-2-klein-9B lora training error #497

@wuyuetiger

Description

@wuyuetiger

when i train flux-2-klein-9B lora, this message is shown and program is blocked. error shows Qwen3_4B, i think flux-2-klein needs Qwen3_8B. i have no idea. help!

[rank0]: Traceback (most recent call last):
[rank0]: File "/mnt/i/AI/diffusion-pipe/train.py", line 495, in
[rank0]: dataset_manager.cache()
[rank0]: File "/mnt/i/AI/diffusion-pipe/utils/dataset.py", line 1148, in cache
[rank0]: self._handle_task(task)
[rank0]: File "/mnt/i/AI/diffusion-pipe/venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
[rank0]: return func(*args, **kwargs)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/utils/dataset.py", line 1187, in _handle_task
[rank0]: self.submodels[id].load_model_if_needed()
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/base.py", line 362, in load_model_if_needed
[rank0]: self._model = self._load_fn()
[rank0]: ^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/base.py", line 414, in load_fn
[rank0]: return comfy.sd.load_clip(ckpt_paths=paths, clip_type=clip_type)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 970, in load_clip
[rank0]: return load_text_encoder_state_dicts(clip_data, embedding_directory=embedding_directory, clip_type=clip_type, model_options=model_options)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 1221, in load_text_encoder_state_dicts
[rank0]: m, u = clip.load_sd(c)
[rank0]: ^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd.py", line 264, in load_sd
[rank0]: return self.cond_stage_model.load_sd(sd)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd1_clip.py", line 711, in load_sd
[rank0]: return getattr(self, self.clip).load_sd(sd)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/models/../submodules/ComfyUI/comfy/sd1_clip.py", line 305, in load_sd
[rank0]: return self.transformer.load_state_dict(sd, strict=False)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/mnt/i/AI/diffusion-pipe/venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 2593, in load_state_dict
[rank0]: raise RuntimeError(
[rank0]: RuntimeError: Error(s) in loading state_dict for Qwen3_4B:
[rank0]: size mismatch for model.embed_tokens.weight: copying a param with shape torch.Size([151936, 4096]) from checkpoint, the shape in current model is torch.Size([151936, 2560]).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions