text-to-speech

speecht5_tts-ft-voxpopuli-it

This model is a fine-tuned version of microsoft/speecht5_tts on the facebook/voxpopuli dataset. It achieves the following results on the evaluation set:

Model description

It uses the speaker embedding model speechbrain/spkrec-xvect-voxceleb

Intended uses & limitations

More information needed

Training and evaluation data

test_size=0.15

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss
0.6118 1.94 300 0.5508
0.5729 3.89 600 0.5204
0.563 5.83 900 0.5126

Framework versions