Add Dia TTS fine-tuning tutorial by Deep-unlearning · Pull Request #232 · huggingface/audio-transformers-course

Deep-unlearning · 2025-12-17T21:32:47Z

This adds a comprehensive tutorial for fine-tuning Dia (Nari Labs' conversational TTS model) for text-to-speech synthesis on a new language. The tutorial covers:

Understanding Dia's multi-speaker dialogue format with [S1], [S2] tags
Loading and preparing conversational speech datasets
Creating a custom DiaDataCollator for proper audio/text formatting
Memory-efficient training with 8-bit optimization
Single-speaker and multi-speaker dialogue generation
Adapting to different languages (French, German examples)
Evaluation strategies for TTS models

This adds a comprehensive tutorial for fine-tuning Dia (Nari Labs' conversational TTS model) for text-to-speech synthesis on a new language. The tutorial covers: - Understanding Dia's multi-speaker dialogue format with [S1], [S2] tags - Loading and preparing conversational speech datasets - Creating a custom DiaDataCollator for proper audio/text formatting - Memory-efficient training with 8-bit optimization - Single-speaker and multi-speaker dialogue generation - Adapting to different languages (French, German examples) - Evaluation strategies for TTS models

Deep-unlearning added 2 commits December 17, 2025 21:16

nit

3c9c707

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Dia TTS fine-tuning tutorial#232

Add Dia TTS fine-tuning tutorial#232
Deep-unlearning wants to merge 2 commits intomainfrom
finetune-dia

Deep-unlearning commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Deep-unlearning commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant