audio automatic-speech-recognition speech

Wav2Vec2 Accent Japanese

Fine-tuned facebook/wav2vec2-large-xlsr-53 on Japanese accent dataset When using this model, make sure that your speech input is sampled at 16kHz.

Test Result

WER: 15.82%