automatic-speech-recognition generated_from_trainer false nb-NO robust-speech-event model_for_talk hf-asr-leaderboard

XLS-R-300M-LM - Norwegian

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the Norwegian NPSC dataset.

Scores without Language Model

Without using a language model, it achieves the following scores on the NPSC Eval set It achieves the following results on the evaluation set without a language model:

Scores with Language Model

A 5-gram KenLM was added to boost the models performance. The language model was created on a corpus mainly consisting of online newspapers, public reports and Wikipedia data. After this we are getting these values.

Team

The model is developed by Rolv-Arild Braaten, Per Egil Kummervold, Andre Kåsen, Javier de la Rosa, Per Erik Solberg, and Freddy Wetjen. Name in alphabetic order.

Model description

This current version is based on checkpoint 8500 of NbAiLab/wav2vec2-xlsr-300M-NPSC-OH.

Intended uses & limitations

Demo version only. The model will be updated later this week.

Training and evaluation data

The model is trained and evaluated on NPSC. Unfortunately there is no Norwegian test data in Common Voice, and currently the model is only evaluated on the validation set of NPSC..

Training procedure

Training hyperparameters

The following hyperparameters were used during training: