espnet audio speech-recognition

ESPnet2 ASR model

espnet/wanchichen_fleurs_english_asr_wav2vec_frontend

This model was trained by William Chen using the fleurs recipe in espnet.

Demo: How to use in ESPnet2

cd espnet
pip install -e .
cd egs2/fleurs/asr1
./run.sh

<!-- Generated by scripts/utils/show_asr_result.sh -->

RESULTS

Environments

asr_train_asr_wav2vec_960h_transformer_raw_en_us_bpe300_sp

WER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_asr_model_valid.acc.best/test_all 647 14344 67.1 29.4 3.5 4.6 37.5 99.8
decode_asr_asr_model_valid.acc.best/dev_all 388 7935 66.8 29.7 3.6 5.0 38.2 99.0

CER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_asr_model_valid.acc.best/test_all 647 83954 88.6 5.4 6.0 4.8 16.2 99.8
decode_asr_asr_model_valid.acc.best/dev_all 388 47051 88.1 6.0 5.9 4.4 16.3 99.0

TER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_asr_model_valid.acc.best/test_all 647 39965 7.7 14.9 7.4 4.1 26.4 99.8
decode_asr_asr_model_valid.acc.best/dev_all 388 22491 77.3 15.2 7.5 3.8 26.5 99.0