SegDepthFusion

A high-performance, real-time segmentation and depth estimation pipeline designed for high-end NVIDIA GPUs. This project seamlessly integrates SAM 3 (Segment Anything Model 3) and Depth Anything V3 to provide instance-aware depth maps.

Features

Real-time Processing: Optimized for high-throughput inference.
Instance-Aware Depth: Combines semantic masks from SAM 3 with metric depth from Depth Anything 3.
Automated Setup: One-click environment configuration for complex dependencies (PyTorch + Local Packages).
Screen Capture Integration: Built-in support for real-time screen inference.
Cross-Platform: Supports both Windows 11 and Ubuntu/Linux.

Installation

Requirements: NVIDIA GPU with CUDA 12.x support. Supports Windows 11 and Ubuntu 20.04+.

1. Clone the Repository

Ensure you clone with submodules to get the core model architectures:

git clone --recursive https://github.com/Lloyd-lei/SegDepthFusion.git
cd SegDepthFusion

2. One-Click Setup

Linux/Ubuntu

Run the setup script in your terminal:

bash auto_setup.sh

This script will:

Create a Conda environment named seg_depth_auto (Python 3.11).
Install project dependencies (xformers, triton, etc.).
Install local sam3 and Depth-Anything-3 modules in editable mode.

Note: The script assumes PyTorch and CUDA are already installed on your system. If not, install them first:
conda activate seg_depth_auto
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128

Windows (PowerShell)

Run this command in PowerShell:

powershell -ExecutionPolicy Bypass -File auto_setup.ps1

This script will:

Create a Conda environment named seg_depth_auto (Python 3.11).
Install PyTorch, TorchVision, and xformers compatible with your GPU.
Install general dependencies (numpy, cv2, etc.).
Compile and install local sam3 and Depth-Anything-3 modules.

3. Activate Environment

conda activate seg_depth_auto

Usage

Quick Test

Verify your installation and model loading with the included test script:

python quick_test.py

Run Pipeline on Images

Process a folder of images to generate segmentation + depth visualizations:

python test_pipline.py --folder orange_photos

Results will be saved to the outputs/ directory.

Real-time Main Loop

To run the main real-time processing loop (configurable via config.yaml):

python main.py

Directory Structure

seg_depth/
├── auto_setup.sh           # Linux installation script
├── auto_setup.ps1          # Windows installation script
├── requirements.txt        # Python dependencies
├── config.yaml             # Pipeline configuration
├── main.py                 # Real-time application entry point
├── test_pipline.py         # Batch image processing script
├── quick_test.py           # Installation verification
├── sam3_model.py           # SAM 3 Model Wrapper
├── da3_model.py            # Depth Anything 3 Model Wrapper
├── seg_depth_pipeline.py   # Core logic combining Seg + Depth
├── orange_photos/          # Test images directory
├── Depth-Anything-3/       # [Submodule] Depth Anything V3 source
└── sam3/                   # [Submodule] SAM 3 source

Troubleshooting

gsplat Warning: You may see a warning about gsplat missing. This is optional for 3D rendering and does not affect the core pipeline.
triton Warning: On Windows, xformers may warn about missing triton. This is normal (Linux-only feature) and safe to ignore. On Linux, triton is installed automatically for better performance.

License

This project is based on SAM 3 and Depth Anything. Please refer to their respective repositories for license details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SegDepthFusion

Features

Installation

1. Clone the Repository

2. One-Click Setup

Linux/Ubuntu

Windows (PowerShell)

3. Activate Environment

Usage

Quick Test

Run Pipeline on Images

Real-time Main Loop

Directory Structure

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Depth-Anything-3		Depth-Anything-3
orange_photos		orange_photos
outputs		outputs
sam3		sam3
.gitignore		.gitignore
README.md		README.md
auto_setup.ps1		auto_setup.ps1
auto_setup.sh		auto_setup.sh
config.yaml		config.yaml
da3_model.py		da3_model.py
deploy.py		deploy.py
inspect_models.py		inspect_models.py
install.md		install.md
main.py		main.py
quick_test.py		quick_test.py
requirements.txt		requirements.txt
restore_env.ps1		restore_env.ps1
sam3_model.py		sam3_model.py
screen_capture.py		screen_capture.py
seg_depth_pipeline.py		seg_depth_pipeline.py
setup_clean_env.ps1		setup_clean_env.ps1
test_pipline.py		test_pipline.py
verify_env.py		verify_env.py
visualizer.py		visualizer.py

Folders and files

Latest commit

History

Repository files navigation

SegDepthFusion

Features

Installation

1. Clone the Repository

2. One-Click Setup

Linux/Ubuntu

Windows (PowerShell)

3. Activate Environment

Usage

Quick Test

Run Pipeline on Images

Real-time Main Loop

Directory Structure

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages