[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
-
Updated
Oct 10, 2025 - Python
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
Unofficial implementation of VTok (https://arxiv.org/pdf/2602.04202)
Code and dataset for evaluating Multimodal LLMs on indexical, iconic, and symbolic gestures (Nishida et al., ACL 2025)
Add a description, image, and links to the llava-next-video topic page so that developers can more easily learn about it.
To associate your repository with the llava-next-video topic, visit your repo's landing page and select "manage topics."