audio automatic-speech-recognition speech

Wav2Vec2 Accent Japanese

Fine-tuned facebook/wav2vec2-large-xlsr-53 on Japanese accent dataset When using this model, make sure that your speech input is sampled at 16kHz.

Test Result

WER: 15.82%

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

Unreal engine based photo realistic synthetic data generator for YOLO.

AI powered 3d texture generation and projection SDK for three.js.