Official PyTorch implementation of GOAL, a frame-supervised VMR framework for elastic boundary localization via a game-based paradigm and Dynamic Updating Technique (DUT).
Hao Liu1, Yupeng Hu1*, Kun Wang1, Yinwei Wei1, Liqiang Nie2
1 School of Software, Shandong University, Jinan, China
2 School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China
* Corresponding author
- Paper (SIGIR 2025): https://doi.org/10.1145/3726302.3729984
- Updates
- Introduction
- Highlights
- Method Overview
- Repository Structure
- Installation
- Dataset Preparation
- Checkpoints
- Training
- Evaluation
- Results
- Citation
- Acknowledgement
- License
- Contact
- [04/2026] Initial public code release.
This repository contains the implementation of GOAL: Gaming fOr elAstic Localization for Frame-Supervised Video Moment Retrieval (SIGIR 2025).
Frame-supervised Video Moment Retrieval (VMR) aims to retrieve the temporal moment in a video that matches a natural language query using a single annotated frame. GOAL addresses ambiguous boundary retrieval with a game-based paradigm (KFP/AFP/BP) and Dynamic Updating Technique (DUT), improving boundary decisions through unilateral and bilateral updates.
This repository currently provides:
- training code for frame-supervised VMR
- evaluation code for benchmark datasets
- checkpoint files for quick reproduction
- utilities for dataset preparation and experiment management
- Supports three standard datasets:
activitynetcaptions,charadessta,tacos. - Includes training and evaluation scripts.
- Includes checkpoint folders for quick reproduction:
Cha_ckpt/(I3D / VGG)TACoS_ckpt/
|- assets/
| |- pipeline.png
|- src/
| |- config.yaml
| |- dataset/
| | |- dataset.py
| | |- generate_glance.py
| | |- generate_duration_glance.py
| |- experiment/
| | |- train.py
| | |- eval.py
| |- model/
| | |- model.py
| | |- building_blocks.py
| |- utils/
| | |- utils.py
| | |- vl_utils.py
|- README.md
git clone https://github.com/iLearn-Lab/SIGIR25-GOAL.git
cd GOALpython -m venv .venv
source .venv/bin/activate # Linux / Mac
# .venv\Scripts\activate # Windowspip install numpy scipy pyyaml tqdmThis repository follows ViGA's dataset preparation protocol for:
- ActivityNet Captions
- Charades-STA
- TACoS
This repo currently contains local paths (for example under data/...), mainly in:
src/config.yamlsrc/utils/utils.py
Before running, replace them with your local dataset root and feature paths.
The cloud links of checkpoints: Google Drive and Hugging Face.
ActivityNet Captions:
python -m src.experiment.train --task activitynetcaptionsCharades-STA:
python -m src.experiment.train --task charadesstaTACoS:
python -m src.experiment.train --task tacosEvaluate a trained experiment folder:
python -m src.experiment.eval --exp path/to/your/experiment_folderThe folder should contain:
config.yamlmodel_best.pt
| Method | Venue | R@0.3 | R@0.5 | R@0.7 | mIoU |
|---|---|---|---|---|---|
| ViGA | SIGIR 2022 | 71.21 | 45.05 | 20.27 | 44.57 |
| CFMR | MM 2023 | - | 48.14 | 22.58 | - |
| SG-SCI | MM 2024 | 70.30 | 52.07 | 27.23 | 46.77 |
| GOAL (Ours) | SIGIR 2025 | 71.42 | 55.56 | 27.69 | 47.70 |
| Method | Venue | R@0.3 | R@0.5 | R@0.7 | mIoU |
|---|---|---|---|---|---|
| ViGA | SIGIR 2022 | 60.22 | 36.72 | 17.20 | 38.62 |
| D3G | ICCV 2023 | - | 41.64 | 19.60 | - |
| GOAL (Ours) | SIGIR 2025 | 61.16 | 42.69 | 21.53 | 40.56 |
| Method | Venue | R@0.3 | R@0.5 | R@0.7 | mIoU |
|---|---|---|---|---|---|
| ViGA | SIGIR 2022 | 56.85 | 35.11 | 15.11 | 36.35 |
| GOAL (Ours) | SIGIR 2025 | 56.96 | 37.74 | 17.02 | 36.74 |
| Method | Venue | R@0.3 | R@0.5 | R@0.7 | mIoU |
|---|---|---|---|---|---|
| ViGA | SIGIR 2022 | 59.61 | 35.79 | 16.96 | 40.12 |
| CFMR | MM 2023 | - | 36.97 | 17.33 | - |
| D3G | ICCV 2023 | 58.25 | 36.68 | 18.54 | - |
| GOAL (Ours) | SIGIR 2025 | 59.19 | 38.72 | 18.57 | 40.22 |
@inproceedings{liu2025gaming,
title={Gaming for Boundary: Elastic Localization for Frame-Supervised Video Moment Retrieval},
author={Liu, Hao and Hu, Yupeng and Wang, Kun and Wei, Yinwei and Nie, Liqiang},
booktitle={Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval},
year={2025},
doi={10.1145/3726302.3729984}
}- The dataset preparation and organization in this repository follow ViGA.
- Thanks to all collaborators and contributors of this project.
This project is released under the Apache License 2.0.
If you have any questions, feel free to contact me at liuh90210@gmail.com.
