VideoArtGS

VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video

Yu Liu^1,2, Baoxiong Jia², Ruijie Lu^2,3, Chuyue Gan^2,3, Huayu Chen^2,3,
Junfeng Ni^1,2, Song-Chun Zhu^1,2,3, Siyuan Huang²
¹Tsinghua University ²National Key Lab of General AI, BIGAI ³Peking University

Website | Arxiv | Data

Environment Setup

We provide a script install.sh to install the environment. In our experiments, we used NVIDIA CUDA 12.4 on Ubuntu 22.04. You may need to modify the installation command according to your CUDA version.

Data Preparation

For VideoArtGS-20 Dataset, we provide data at here.

For Video2Articulation Dataset, please download the data from Video2Articulation, and the Partnet-Mobility dataset, and then preprocess the data using python data_tools/process_v2a.py. You can also download the processed version at here.

Data structure:

data
├── videoartgs
│   ├── realscan
│   │   ├── microwave
│   │   │   ├── images
│   │   │   ├── ...
│   ├── sapien
│   │   ├── 100481
│   │   │   ├── images
│   │   │   ├── ...
├── v2a
│   ├── sapien
│   │   ├── 100068_joint_0_bg_view_0
│   │   │   ├── images
│   │   │   ├── ...

Training

We provide the following files and scripts for training:

init_cano.py & scripts/init_cano.sh : training the coarse single state Gaussians.
init_deform.py & scripts/init_deform.sh : training the deformable Gaussians.
train.py & scripts/train.sh: training the full model.
train_gui.py : training the full model with GUI visualization.

Please run scripts/init_cano.sh and scripts/init_deform.sh before running scripts/train.sh.

Reloading checkpoints & Evaluation

We provide render.py and script scripts/render.sh, scripts/eval.sh for evaluation. You can download the checkpoints from here and put them in the outputs folder.

Visualization Tools

We provide some visualization tools for intermediate results in vis_utils folder. You can visualize the point cloud, joint, and centers for initialization in vis_utils/vis_init.ipynb and visualize the Gaussians and deformation models in vis_utils/vis_videoartgs.ipynb.

Export URDF and USD Files

We provide vis_utils/json2urdf.py to export URDF files from the trained model. Load URDF files with IsaacSim (>=4.5) to export USD files. We found that IsaacSim can not load texture of .ply meshes. We provide a scriptvis_utils/ply2glb.py, which uses Blender to transform the .ply meshes to .glb meshes.

Reconstruct Articulated Objects from Self-captured Video

See detailed instructions in preprocess.md.

Citation

If you find our paper and/or code helpful, please consider citing:

@article{liu2025videoartgs,
  title={VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video},
  author={Liu, Yu and Jia, Baoxiong and Lu, Ruijie and Gan, Chuyue and Chen, Huayu and Ni, Junfeng and Zhu, Song-Chun and Huang, Siyuan},
  journal={arXiv preprint arXiv:2509.17647},
  year={2025}
}

Acknowledgement

This code heavily used resources from ArtGS, SpatialTrackerV2, TAPIP3D, and Video2Articulation. We thank the authors for open-sourcing their awesome projects.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
arguments		arguments
assets/images		assets/images
data_tools		data_tools
gaussian_renderer		gaussian_renderer
scene		scene
scripts		scripts
third_party		third_party
utils		utils
vis_utils		vis_utils
.gitignore		.gitignore
README.md		README.md
cam_utils.py		cam_utils.py
eval.py		eval.py
eval_track.py		eval_track.py
init_cano.py		init_cano.py
init_deform.py		init_deform.py
install.sh		install.sh
pyrightconfig.json		pyrightconfig.json
render.py		render.py
render_mask.py		render_mask.py
requirements.txt		requirements.txt
train.py		train.py
train_gui.py		train_gui.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VideoArtGS

VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video

Environment Setup

Data Preparation

Training

Reloading checkpoints & Evaluation

Visualization Tools

Export URDF and USD Files

Reconstruct Articulated Objects from Self-captured Video

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

VideoArtGS

VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video

Environment Setup

Data Preparation

Training

Reloading checkpoints & Evaluation

Visualization Tools

Export URDF and USD Files

Reconstruct Articulated Objects from Self-captured Video

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages