huggingface · tangyanf · Jun 18, 2026 · Jun 22, 2026 · Jun 23, 2026 · Jun 23, 2026
diff --git a/docs/source/en/_toctree.yml b/docs/source/en/_toctree.yml
@@ -355,6 +355,8 @@
         title: Ideogram4Transformer2DModel
       - local: api/models/transformer_joyimage
         title: JoyImageEditTransformer3DModel
+      - local: api/models/transformer_joyimage_edit_plus
+        title: JoyImageEditPlusTransformer3DModel
       - local: api/models/krea2_transformer2d
         title: Krea2Transformer2DModel
       - local: api/models/latte_transformer3d
@@ -555,6 +557,8 @@
         title: InstructPix2Pix
       - local: api/pipelines/joyimage_edit
         title: JoyImage Edit
+      - local: api/pipelines/joyimage_edit_plus
+        title: JoyImage Edit Plus
       - local: api/pipelines/kandinsky
         title: Kandinsky 2.1
       - local: api/pipelines/kandinsky_v22

diff --git a/docs/source/en/api/models/transformer_joyimage_edit_plus.md b/docs/source/en/api/models/transformer_joyimage_edit_plus.md
@@ -0,0 +1,29 @@
+<!--Copyright 2025 The HuggingFace Team. All rights reserved.
+
+Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
+an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
+specific language governing permissions and limitations under the License.
+-->
+
+# JoyImageEditPlusTransformer3DModel
+
+The model can be loaded with the following code snippet.
+
+```python
+from diffusers import JoyImageEditPlusTransformer3DModel
+
+transformer = JoyImageEditPlusTransformer3DModel.from_pretrained("jdopensource/JoyAI-Image-Edit-Plus-Diffusers", subfolder="transformer", torch_dtype=torch.bfloat16)
+```
+
+## JoyImageEditPlusTransformer3DModel
+
+[[autodoc]] JoyImageEditPlusTransformer3DModel
+
+## Transformer2DModelOutput
+
+[[autodoc]] models.modeling_outputs.Transformer2DModelOutput
diff --git a/docs/source/en/api/pipelines/joyimage_edit_plus.md b/docs/source/en/api/pipelines/joyimage_edit_plus.md
@@ -0,0 +1,61 @@
+<!--Copyright 2025 The HuggingFace Team. All rights reserved.
+
+Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
+an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
+specific language governing permissions and limitations under the License.
+-->
+
+# JoyAI-Image-Edit-Plus
+
+[JoyAI-Image](https://github.com/jd-opensource/JoyAI-Image) is a unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing. It combines an 8B Multimodal Large Language Model (MLLM) with a 16B Multimodal Diffusion Transformer (MMDiT).
+
+JoyAI-Image-Edit-Plus is a multi-image instruction-guided editing model that accepts **multiple reference images** and a text instruction to generate a new image that combines elements from the references according to the instruction. It supports 1–5 reference images per sample.
+
+| Model | Description | Download |
+|:-----:|:-----------:|:--------:|
+| JoyAI-Image-Edit-Plus | Multi-image instruction-guided editing with element composition from multiple references | [Hugging Face](https://huggingface.co/jdopensource/JoyAI-Image-Edit-Plus-Diffusers) |
+
+```python
+import torch
+from PIL import Image
+from diffusers import JoyImageEditPlusPipeline
+
+pipeline = JoyImageEditPlusPipeline.from_pretrained(
+    "jdopensource/JoyAI-Image-Edit-Plus-Diffusers", torch_dtype=torch.bfloat16
+)
+pipeline.to("cuda")
+
+images = [
+    Image.open("reference_0.png").convert("RGB"),
+    Image.open("reference_1.png").convert("RGB"),
+]
+
+target_h, target_w = pipeline._get_bucket_size(images[-1])
+
+output = pipeline(
+    images=images,
+    prompt="Combine the person from the second image with the scene from the first image.",
+    negative_prompt="low quality, blurry, deformed",
+    height=target_h,
+    width=target_w,
+    num_inference_steps=30,
+    guidance_scale=4.0,
+    generator=torch.Generator("cuda").manual_seed(42),
+).images[0]
+output.save("joyimage_edit_plus_output.png")
+```
+
+## JoyImageEditPlusPipeline
+
+[[autodoc]] JoyImageEditPlusPipeline
+  - all
+  - __call__
+
+## JoyImageEditPlusPipelineOutput
+
+[[autodoc]] pipelines.joyimage.pipeline_output.JoyImageEditPlusPipelineOutput