Advanced Stem Separator

A powerful audio stem separation tool built with Python, using the Demucs deep learning model to separate audio tracks into drums, bass, vocals, and other instruments.

Features

High-quality audio stem separation
User-friendly GUI interface
Real-time progress tracking
Support for MP3, WAV, and FLAC files
Enhanced vocal processing
Memory-efficient chunked processing
CUDA GPU acceleration support

Requirements

Python 3.8 or higher
NVIDIA GPU (optional, for faster processing)
FFmpeg installed on your system

Installation

Platform-Specific Setup

This application requires different PyTorch installations depending on your platform:

Windows (with NVIDIA GPU support)

Clone or download this repository
Run setup_windows_cuda.bat for automatic GPU-enabled setup
Or manually install CUDA-enabled PyTorch:

# Create virtual environment
python -m venv venv_windows
venv_windows\Scripts\activate

# Install CUDA-enabled PyTorch
pip install torch==2.1.0+cu118 torchvision==0.16.0+cu118 torchaudio==2.1.0+cu118 --index-url https://download.pytorch.org/whl/cu118

# Install other dependencies
pip install -r requirements.txt

macOS (CPU-only)

Clone or download this repository
Run ./setup_macos.sh for automatic setup
Or manually install CPU-only PyTorch:

# Install Python 3.12 (required for PyTorch compatibility)
brew install python@3.12

# Create virtual environment
/usr/local/bin/python3.12 -m venv venv_macos
source venv_macos/bin/activate

# Install CPU-only PyTorch
pip install torch torchvision torchaudio

# Install other dependencies
pip install -r requirements.txt

Legacy Setup (Not Recommended)

The old setup.bat and run.bat files may not work with newer Python versions.

Usage

Start the program:
- Double-click run.bat (Windows)
- Or run: python main.py
Using the application:
- Click "Upload Audio File" to select your audio file
- Choose an output directory
- Click "Process and Separate Stems"
- Wait for processing to complete
- Find your separated stems in the output directory

Output Stems

The program generates four separate audio files:

drums.wav: Drum tracks
bass.wav: Bass tracks
vocals.wav: Vocal tracks
other.wav: Other instruments

Troubleshooting

Memory Issues
- The program automatically handles large files through chunk processing
- For very large files, ensure you have at least 8GB of RAM
CUDA/GPU Issues

Windows (NVIDIA GPU):
- Make sure you have NVIDIA GPU drivers installed
- IMPORTANT: Install CUDA-enabled PyTorch for GPU acceleration:
```
# Uninstall CPU-only PyTorch first
pip uninstall torch torchvision torchaudio -y

# Install CUDA-enabled PyTorch
pip install torch==2.1.0+cu118 torchvision==0.16.0+cu118 torchaudio==2.1.0+cu118 --index-url https://download.pytorch.org/whl/cu118
```
- Or run setup_windows_cuda.bat for automatic CUDA setup
- The program will automatically fall back to CPU if GPU is unavailable
- Check GPU detection in the application log output
macOS:
- CUDA is not supported on macOS - use CPU-only PyTorch
- Run ./setup_macos.sh for automatic setup
- The application will automatically use CPU processing
Audio Quality
- For best results, use high-quality input files (WAV/FLAC)
- Output files are saved in high-quality float32 WAV format

Technical Details

Uses Demucs HTDemucs-ft model for separation
Implements chunked processing for memory efficiency
Features crossfade processing for seamless stem combination
Enhanced vocal processing with stereo field preservation

License

This project is licensed under the MIT License - see the LICENSE file for details.

The MIT License is a permissive license that is short and to the point. It lets people do anything they want with your code as long as they provide attribution back to you and don't hold you liable.

Credits

Demucs by Meta Research (Licensed under MIT)
GUI built with PyQt5 (Licensed under GPL)
Audio processing using librosa and soundfile

Contributing

Feel free to submit issues and enhancement requests!

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
audio_processing.py		audio_processing.py
icon.png		icon.png
launcher.py		launcher.py
main.py		main.py
requirements.txt		requirements.txt
run.bat		run.bat
setup.bat		setup.bat
setup_cuda.bat		setup_cuda.bat
setup_macos.sh		setup_macos.sh
setup_windows_cuda.bat		setup_windows_cuda.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced Stem Separator

Features

Requirements

Installation

Platform-Specific Setup

Windows (with NVIDIA GPU support)

macOS (CPU-only)

Legacy Setup (Not Recommended)

Usage

Output Stems

Troubleshooting

Technical Details

License

Credits

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Advanced Stem Separator

Features

Requirements

Installation

Platform-Specific Setup

Windows (with NVIDIA GPU support)

macOS (CPU-only)

Legacy Setup (Not Recommended)

Usage

Output Stems

Troubleshooting

Technical Details

License

Credits

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages