generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

whisper-large-v2-et-children

This model is a fine-tuned version of agnesluhtaru/whisper-large-et-ERR2020-v2 on an Estonian children's speech dataset.

More information about the model's performance and the data used for evaluation and training:

Luhtaru, Agnes; Jaaska, Rauno; Kruusamäe, Karl; Fishel, Mark (2023). Automatic Transcription for Estonian Children’s Speech. In: Proceedings of the 24th Nordic Conference on Computational Linguistics. https://openreview.net/forum?id=xbPTfBIUby

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss Wer
0.0302 4.03 500 0.2971 16.2892
0.0042 8.06 1000 0.3406 15.8551
0.0017 12.1 1500 0.3714 15.5585
0.0009 16.13 2000 0.3934 15.6445

Framework versions