Add JoyAI-Image Edit Plus pipeline and model

### Model/Pipeline/Scheduler description

## Model description  
 __JoyAI-Image__ is a unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing from JD.com. It combines an 8B         
  Multimodal Large Language Model (MLLM) with a 16B Multimodal Diffusion Transformer (MMDiT).                                                                                         
                                                                                                                                                                                      
  __JoyAI-Image Edit Plus__ is the multi-image instruction-guided editing variant. Unlike the single-image JoyAI-Image Edit (added in #XXXX), Edit Plus accepts 1–6 reference images and a
   text instruction to generate a new image that combines elements from the references.
                                                                                                                                                                                      
  Key architectural differences from Edit:                                                                                                                                            
  - __Patchified 6D latent representation:__ Input images and target noise are independently VAE-encoded and patchified into [B, max_patches, C, pt, ph, pw] format with a target_mask to
  distinguish target noise from reference patches.                                                                                                                                    
  - __Variable reference images:__ Supports 1–6 reference images per sample via dynamic shape_list.
  - __Batched RoPE:__ Per-component rotary position embeddings with temporal offsets for each reference image and the target.                                                             
  - __Norm-guided CFG:__ Classifier-free guidance with norm rescaling in a single forward pass.  
  


### Open source status

- [x] The model implementation is available.
- [x] The model weights are available (Only relevant if addition is not a scheduler).

### Provide useful links for the implementation

  - GitHub: https://github.com/jd-opensource/JoyAI-Image                                                                                                                              
  - Weights (Edit): https://huggingface.co/jdopensource/JoyAI-Image-Edit
  - Weights (Edit Plus): https://huggingface.co/jdopensource/JoyAI-Image-Edit-Plus-Diffusers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add JoyAI-Image Edit Plus pipeline and model #14049

Model/Pipeline/Scheduler description

Model description

Open source status

Provide useful links for the implementation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add JoyAI-Image Edit Plus pipeline and model #14049

Description

Model/Pipeline/Scheduler description

Model description

Open source status

Provide useful links for the implementation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions