DTF-VITS

An experimental variation of VITS with Microsoft's Differential Transformer method applied on its text encoder.

Models

normal is the original VITS model with its default setting of 2 transformer heads for comparison. #

dtf is a modified VITS model with 1 differential transformer head. #

dtf_v2 is a modified VITS model with 2 differential transformer heads. #

Each model was trained with LJ Speech dataset for 20000 steps.

Output demonstrations

democomp.mp4

Demonstration of all 50 test samples by each model.

demo2.mp4

Comparison of normal and dtf.

melcompv.mp4

Comparison for the text "These principles of homology are essential to a correct interpretation of the facts of morphology." (LJ027-0052.wav from validation dataset defined in the original VITS repo). Demonstrated spectrograms can be found here.

Test results using NISQA

For each sentence in test script, 10 wav files were generated and tested through NISQA(v2.0) model.

Model	MOS	Noisiness	Discontinuity	Coloration	Loudness
`normal`	4.32 ± 0.37	3.87 ± 0.41	4.53 ± 0.28	4.31 ± 0.22	4.51 ± 0.18
`dtf`	4.24 ± 0.37	3.79 ± 0.42	4.51 ± 0.30	4.28 ± 0.24	4.49 ± 0.20
`dtf_v2`	4.24 ± 0.37	3.86 ± 0.44	4.53 ± 0.27	4.26 ± 0.23	4.47 ± 0.20

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
filelists		filelists
monotonic_align		monotonic_align
resources		resources
text		text
LICENSE		LICENSE
README.md		README.md
S_monotonic_align.py		S_monotonic_align.py
S_monotonic_align_Triton.py		S_monotonic_align_Triton.py
attentions.py		attentions.py
attentions2.py		attentions2.py
attentions3.py		attentions3.py
commons.py		commons.py
data_utils.py		data_utils.py
inference.ipynb		inference.ipynb
inference.py		inference.py
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
models_normal.py		models_normal.py
models_v1.py		models_v1.py
modules.py		modules.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
testscript.txt		testscript.txt
train.py		train.py
train_ms.py		train_ms.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DTF-VITS

Models

Output demonstrations

Test results using NISQA

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DTF-VITS

Models

Output demonstrations

Test results using NISQA

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages