espnet audio speech-recognition

ESPnet2 ASR model

espnet/wanchichen_fleurs_asr_conformer_sctctc

This model was trained by William Chen using the fleurs recipe in espnet.

Demo: How to use in ESPnet2

cd espnet
pip install -e .
cd egs2/fleurs/asr1
./run.sh

<!-- Generated by scripts/utils/show_asr_result.sh -->

RESULTS

Environments

asr_train_asr_xlsr_conformer_scctc_raw_all_bpe6500_sp

WER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all 77809 1592160 70.5 26.1 3.4 3.4 32.9 97.0

CER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all 77809 10235271 92.2 4.7 3.1 2.6 10.4 97.0

TER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all 77809 9622352 91.3 5.6 3.1 2.7 11.4 97.0