S3L

Leveraging Spatiotemporal Cues for Self-Supervised Stereo Depth Estimation in Endoscopic Videos

We have released the segmentation configurations and loading scripts for the SCARED dataset in advance.

📝 Checklist

⬛ Upload data processing code
⬛ Upload inference code
⬛ Upload Upload pre-trained weights
⬛ Upload training code

🛠 Installation

First you have to make sure that you clone the repo with the --recursive flag.

git clone --recursive https://github.com/Intelligent-Imaging-Center/S3L.git
cd S3L

Creating a new conda environment.

conda create --name s3l python=3.9
conda activate s3l

Install CUDA 11.8 and torch-related pacakges

pip install numpy==1.25.0 # do not use numpy >= v2.0.0
conda install --channel "nvidia/label/cuda-11.8.0" cuda-toolkit
pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu118

Now install the other requirements

conda env update -f environment.yml --prune

Basic Dependencies:

CUDA Version >= 11.8
Python >= 3.8
Pytorch >= 2.0.0

📁 Dataset Preparation

We excluded subsets D4 and D5 of the SCARED dataset due to severe calibration inaccuracies and temporal misalignment. The remaining subsets were split as follows: D1–D3 and D6–D7 were used for training, yielding 19 videos with a total of 17,206 frames. D8 and D9 were reserved for testing, providing 8 videos with 5,907 frames.

Please refer to this to prepare your SCARED data.

The folder structure is as follows:
scard/
├── dataset_1/
│   ├── keyframe_1/
│   │   ├── disp/
│   │   ├── left_finalpass/
│   │   └── right_finalpass/
│   ├── keyframe_2/
│   ├── keyframe_3/
│   └── keyframe_4/
├── dataset_2/
│   ├── keyframe_1/
│   ├── keyframe_2/
│   └── keyframe_3/
...
└── dataset_9/
    ├── keyframe_1/
    ├── keyframe_2/
    └── keyframe_3/

✨ Quick Test:

preparing trained model：We will release the pre-trained model soon. start testing single Image

python eval_img.py --load_weights path/to/your/weights/folder --image_path path/to/your/test/image

start testing vedio

python eval_vedio.py --load_weights path/to/your/weights/folder --vedio_path path/to/your/test/vedio

🖋 Train:

python train.py --data_path path/to/your/data --output_name mytrain --config configs/scared/d1/k1.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data_splits/scard		data_splits/scard
datasets		datasets
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S3L

Leveraging Spatiotemporal Cues for Self-Supervised Stereo Depth Estimation in Endoscopic Videos

📝 Checklist

🛠 Installation

Basic Dependencies:

📁 Dataset Preparation

✨ Quick Test:

🖋 Train:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

S3L

Leveraging Spatiotemporal Cues for Self-Supervised Stereo Depth Estimation in Endoscopic Videos

📝 Checklist

🛠 Installation

Basic Dependencies:

📁 Dataset Preparation

✨ Quick Test:

🖋 Train:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages