espnet audio automatic-speech-recognition

RESULTS

Environments

asr_train_asr_raw_word

WER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
inference_asr_model_valid.acc.ave_5best/test 49 4134 64.6 23.5 11.8 16.4 51.8 98.0
inference_asr_model_valid.acc.ave_5best/valid 47 4178 66.8 20.2 13.0 19.2 52.5 100.0

CER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
inference_asr_model_valid.acc.ave_5best/test 49 8690 73.2 18.0 8.8 12.9 39.7 98.0
inference_asr_model_valid.acc.ave_5best/valid 47 8751 74.3 15.7 10.0 15.6 41.3 100.0

TER

dataset Snt Wrd Corr Sub Del Ins Err S.Err