Thanks for releasing FantasyWorld — this unified video + 3D world modeling framework looks very promising! I am planning to run inference with your provided code and have a few questions about hardware requirements.
1. VRAM requirement
- What is the approximate GPU memory (VRAM) needed to run a single inference pass with the default model (e.g., frozen Wan2.1-based backbones + geometry branch), especially for a typical case like ~80 frames at 480p?
- Do you have any measurements of peak GPU memory usage during inference?
2. Recommended inference GPU configuration
- What would you consider the best recommended GPU setup for stable inference with FantasyWorld?
- For example, is a single high-memory card (e.g., 48 GB+) sufficient, or do you recommend multi-GPU (e.g., 4×/8× 24 GB GPUs)?
- Are there preferred strategies such as model sharding / tensor parallelism / DeepSpeed inference to distribute across GPUs?
Thank you very much for your time and help!
Thanks for releasing FantasyWorld — this unified video + 3D world modeling framework looks very promising! I am planning to run inference with your provided code and have a few questions about hardware requirements.
1. VRAM requirement
2. Recommended inference GPU configuration
Thank you very much for your time and help!