espnet audio automatic-speech-recognition

ESPnet2 ASR model

espnet/wanchichen_fleurs_asr_conformer_hier_lid_utt

This model was trained by William Chen using the fleurs recipe in espnet.

Demo: How to use in ESPnet2

cd espnet
pip install -e .
cd egs2/fleurs/asr1
./run.sh

<!-- Generated by scripts/utils/show_asr_result.sh -->

RESULTS

Environments

asr_train_asr_conformer_lid_utt_scctc_raw_all_bpe6500_train_data_path_and_name_and_typedumprawtrain_all_splid,lid,text_sp

WER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/dev_all 31622 610500 72.9 24.4 2.7 3.1 30.2 95.5
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/test_all 77809 1592160 72.2 25.0 2.9 3.6 31.5 96.6

CER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/dev_all 31622 3988181 92.6 4.7 2.6 2.2 9.6 95.5
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/test_all 77809 10235271 92.5 4.7 2.8 2.6 10.1 96.7

TER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/dev_all 31622 3547834 91.4 5.8 2.8 2.5 11.0 95.4
decode_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave/test_all 77809 9622352 91.6 5.6 2.8 2.8 11.2 96.6