How to Download the GTZAN Dataset

This project requires the GTZAN Genre Collection dataset. You can download it automatically using the provided Python script (recommended for TSOAI courses) or manually.

Option 1: Automatic Download (Recommended)

This method uses the dataset_downloader.py script to fetch the data directly from Hugging Face and organize the folders according to the project requirements. No API keys are required.

1. Install Requirements

This script uses standard Python libraries, so usually no extra installation is needed. If requests is missing:

pip install requests

2. Run the Script

Run the script to download, extract, and clean up the compressed files automatically:

python dataset_downloader.py

What this script does

Downloads the dataset (genres.tar.gz) directly from the Marsyas/GTZAN Hugging Face repository.
Creates Directory: It creates a ./GTZAN_dataset folder in your project root.
Extracts: Unzips the contents directly into that folder.
Cleanup: Automatically removes the .tar.gz file after extraction to save space.

Option 2: Manual Download

If the script fails or you prefer to manage files yourself:

1. Download

Visit the Hugging Face file repository: https://huggingface.co/datasets/marsyas/gtzan/tree/main/data
Download the file named genres.tar.gz.

2. Extract and Organize

Create a folder named GTZAN_dataset in your project root.
Extract the contents of genres.tar.gz into that folder.

3. Verify Directory Structure

Your project directory should look exactly like this for the code to run correctly:

DLforAudio/
├── GTZAN_dataset/                 
│   └── genres/         
│       ├── blues/            <-- For every genre, there is a folder.
│       │   └── blues.00000.wav 
│       └── ...
└── dataset_downloader.py

4. Path Configuration

Ensure your preprocessing scripts point to the correct folder location:

# preprocess.py (or your config file)
DATASET_PATH = "GTZAN_dataset/genres/"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Download the GTZAN Dataset

Option 1: Automatic Download (Recommended)

1. Install Requirements

2. Run the Script

What this script does

Option 2: Manual Download

1. Download

2. Extract and Organize

3. Verify Directory Structure

4. Path Configuration

FilesExpand file tree

Instructions_GTZAN.md

Latest commit

History

Instructions_GTZAN.md

File metadata and controls

How to Download the GTZAN Dataset

Option 1: Automatic Download (Recommended)

1. Install Requirements

2. Run the Script

What this script does

Option 2: Manual Download

1. Download

2. Extract and Organize

3. Verify Directory Structure

4. Path Configuration