automatic-speech-recognition k2

Icefall streaming ASR model for Estonian

This is a streaming end-to-end transducer model for Estonian, trained using Icefall

It is trained on around 800 h of manually transcribed speech from various domains and on about 2500 h of automatically transcribed speech from Estonian TV (mainly news and talkshows)

Serving

To use it on a server for browser-based ASR: