Replaced the dev environment with a task; improved configuration (uv, cache), single/multiple gpus supports; also added more details; added the example into the docs.

peterschmidt85 · peterschmidt85 · commit 8bebb07ac84e · 2025-08-27T12:28:32.000+02:00
diff --git a/docs/examples.md b/docs/examples.md
@@ -230,6 +230,22 @@ hide:
     </a>
 </div>
 
+## Models
+
+<div class="tx-landing__highlights_grid">
+    <a href="/examples/models/wan22"
+       class="feature-cell sky">
+        <h3>
+            Wan2.2
+        </h3>
+
+        <p>
+            Use Wan2.2 to generate videos from text
+        </p>
+    </a>
+</div>
+
+
 <!-- ## Misc
 
 <div class="tx-landing__highlights_grid">
diff --git a/docs/examples/models/wan22/index.md b/docs/examples/models/wan22/index.md
diff --git a/docs/overrides/main.html b/docs/overrides/main.html
@@ -122,6 +122,7 @@
                 <a href="/examples#distributed-training" class="tx-footer__section-link">Distributed training</a>
                 <a href="/examples#clusters" class="tx-footer__section-link">Clusters</a>
                 <a href="/examples#inference" class="tx-footer__section-link">Inference</a>
+                <a href="/examples#models" class="tx-footer__section-link">Models</a>
               </div>
               
               <div class="tx-footer__section">
diff --git a/examples/models/wan22/.dstack.yml b/examples/models/wan22/.dstack.yml
@@ -0,0 +1,63 @@
+type: task
+name: wan22
+
+repos:
+  # Clones it to `/workflow` (the default working directory)
+  - https://github.com/Wan-Video/Wan2.2.git
+
+python: 3.12
+nvcc: true
+
+env:
+  - PROMPT="Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage."
+  # Required for storing cache on a volume
+  - UV_LINK_MODE=copy
+commands:
+  # Install flash-attn
+  - |
+    uv pip install torch
+    uv pip install flash-attn --no-build-isolation 
+  # Install dependencies
+  - |
+    uv pip install . decord librosa
+    uv pip install "huggingface_hub[cli]"
+    hf download Wan-AI/Wan2.2-T2V-A14B --local-dir /root/.cache/Wan2.2-T2V-A14B
+  # Generate video
+  - |
+    if [ ${DSTACK_GPUS_NUM} -gt 1 ]; then
+      torchrun \
+        --nproc_per_node=${DSTACK_GPUS_NUM} \
+        generate.py \
+        --task t2v-A14B \
+        --size 1280*720 \
+        --ckpt_dir /root/.cache/Wan2.2-T2V-A14B \
+        --dit_fsdp --t5_fsdp --ulysses_size ${DSTACK_GPUS_NUM} \
+        --save_file ${DSTACK_RUN_NAME}.mp4 \
+        --prompt "${PROMPT}"
+    else
+      python generate.py \
+        --task t2v-A14B \
+        --size 1280*720 \
+        --ckpt_dir /root/.cache/Wan2.2-T2V-A14B \
+        --offload_model True \
+        --convert_model_dtype \
+        --save_file ${DSTACK_RUN_NAME}.mp4 \
+        --prompt "${PROMPT}"
+    fi
+  # Upload video
+  - curl https://bashupload.com/ -T ./${DSTACK_RUN_NAME}.mp4
+
+resources: 
+  gpu:
+    name: [H100, H200]
+    count: 1..8
+  disk: 300GB
+
+# Change to on-demand for disabling spot
+spot_policy: auto
+
+volumes:
+  # Cache pip packages and HF models
+  - instance_path: /root/dstack-cache
+    path: /root/.cache/
+    optional: true
diff --git a/examples/models/wan22/README.md b/examples/models/wan22/README.md
@@ -1,6 +1,6 @@
 # Wan2.2
 
-[Wan2.2](https://github.com/Wan-Video/Wan2.2) is an open-source SOTA foundational video model. This example shows how to run the T2V-A14B model variant via `dstack` for text-to-video generation.
+[Wan2.2 :material-arrow-top-right-thin:{ .external }](https://github.com/Wan-Video/Wan2.2){:target="_blank"} is an open-source SOTA foundational video model. This example shows how to run the T2V-A14B model variant via `dstack` for text-to-video generation.
 
 ??? info "Prerequisites"
     Once `dstack` is [installed](https://dstack.ai/docs/installation), clone the repo with examples.
@@ -14,28 +14,129 @@
  
     </div>
 
-Apply the [configuration](https://github.com/dstackai/dstack/blob/master/examples/models/wan22/dev-env.dstack.yml) to provision a GPU instance and run a dev environment with all the Wan2.2 dependencies installed:
+## Define a configuration
+
+Below is a task configuration that generates a video using Wan2.2, uploads it, and provides the download link.
+
+<div editor-title="examples/models/wan22/.dstack.yml"> 
+
+```yaml
+type: task
+name: wan22
+
+repos:
+  # Clones it to `/workflow` (the default working directory)
+  - https://github.com/Wan-Video/Wan2.2.git
+
+python: 3.12
+nvcc: true
+
+env:
+  - PROMPT="Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage."
+  # Required for storing cache on a volume
+  - UV_LINK_MODE=copy
+commands:
+  # Install flash-attn
+  - |
+    uv pip install torch
+    uv pip install flash-attn --no-build-isolation 
+  # Install dependencies
+  - |
+    uv pip install . decord librosa
+    uv pip install "huggingface_hub[cli]"
+    hf download Wan-AI/Wan2.2-T2V-A14B --local-dir /root/.cache/Wan2.2-T2V-A14B
+  # Generate video
+  - |
+    if [ ${DSTACK_GPUS_NUM} -gt 1 ]; then
+      torchrun \
+        --nproc_per_node=${DSTACK_GPUS_NUM} \
+        generate.py \
+        --task t2v-A14B \
+        --size 1280*720 \
+        --ckpt_dir /root/.cache/Wan2.2-T2V-A14B \
+        --dit_fsdp --t5_fsdp --ulysses_size ${DSTACK_GPUS_NUM} \
+        --save_file ${DSTACK_RUN_NAME}.mp4 \
+        --prompt "${PROMPT}"
+    else
+      python generate.py \
+        --task t2v-A14B \
+        --size 1280*720 \
+        --ckpt_dir /root/.cache/Wan2.2-T2V-A14B \
+        --offload_model True \
+        --convert_model_dtype \
+        --save_file ${DSTACK_RUN_NAME}.mp4 \
+        --prompt "${PROMPT}"
+    fi
+  # Upload video
+  - curl https://bashupload.com/ -T ./${DSTACK_RUN_NAME}.mp4
+
+resources: 
+  gpu:
+    name: [H100, H200]
+    count: 1..8
+  disk: 300GB
+
+# Change to on-demand for disabling spot
+spot_policy: auto
+
+volumes:
+  # Cache pip packages and HF models
+  - instance_path: /root/dstack-cache
+    path: /root/.cache/
+    optional: true
+```
+
+</div>
+
+You can customize the 
+
+## Run the configuration
+
+Once the configuration is ready, run `dstack apply -f <configuration file>`, and `dstack` will automatically provision the
+cloud resources and run the configuration.
 
 <div class="termy">
 
 ```shell
-$ dstack apply -f examples/models/wan22/dev-env.dstack.yml
-Provisioning...
+$ dstack apply -f examples/models/wan22/.dstack.yml
+
+ #  BACKEND              RESOURCES                                        INSTANCE TYPE   PRICE
+ 1  datacrunch (FIN-01)  cpu=30 mem=120GB disk=200GB H100:80GB:1 (spot)   1H100.80S.30V   $0.99
+ 2  datacrunch (FIN-01)  cpu=30 mem=120GB disk=200GB H100:80GB:1 (spot)   1H100.80S.30V   $0.99
+ 3  datacrunch (FIN-02)  cpu=44 mem=182GB disk=200GB H200:141GB:1 (spot)  1H200.141S.44V  $0.99
+
 ---> 100%
+
+Uploaded 1 file, 8 375 523 bytes
+
+wget https://bashupload.com/fIo7l/wan22.mp4
 ```
 
 </div>
 
-Then you can attach to the dev environment and generate videos:
+If you want you can override the default GPU, spot policy, and even the prompt via the CLI.
 
 <div class="termy">
 
 ```shell
-$ torchrun --nproc_per_node=8 generate.py --task t2v-A14B --size 1280*720 --ckpt_dir ./Wan2.2-T2V-A14B --dit_fsdp --t5_fsdp --ulysses_size 8 --prompt "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage."
+$ PROMPT=...
+$ dstack apply -f examples/models/wan22/.dstack.yml --spot --gpu H100,H200:8
 
-[2025-08-26 05:41:54,911] INFO: Input prompt: Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage.
-[2025-08-26 05:41:54,912] INFO: Creating WanT2V pipeline.
-[2025-08-26 05:42:50,296] INFO: loading ./Wan2.2-T2V-A14B/models_t5_umt5-xxl-enc-bf16.pth
+ #  BACKEND              RESOURCES                                          INSTANCE TYPE    PRICE
+ 1  aws (us-east-2)      cpu=192 mem=2048GB disk=300GB H100:80GB:8 (spot)   p5.48xlarge      $6.963
+ 2  datacrunch (FIN-02)  cpu=176 mem=1480GB disk=300GB H100:80GB:8 (spot)   8H100.80S.176V   $7.93
+ 3  datacrunch (ICE-01)  cpu=176 mem=1450GB disk=300GB H200:141GB:8 (spot)  8H200.141S.176V  $7.96
+ 
+---> 100%
+
+Uploaded 1 file, 8 375 523 bytes
+
+wget https://bashupload.com/fIo7l/wan22.mp4
 ```
 
 </div>
+
+## Source code
+
+The source-code of this example can be found in
+[`examples/models/wan22` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/models/wan22){:target="_blank"}.
diff --git a/examples/models/wan22/dev-env.dstack.yml b/examples/models/wan22/dev-env.dstack.yml