ksponspeech

stt_kr_conformer_ctc_medium

Fine-tuned from "stt_en_conformer_ctc_medium" https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_en_conformer_ctc_medium
Trained on KsponSpeech, provided by https://aihub.or.kr/

Preprocessing

Files converted from .pcm -> .wav
Text - Korean phonetic transcription
SentencePiece tokenizer (Byte-pair encoding), vocab-size = 5,000

Evaluation

"KsponSpeech_eval_clean", "KsponSpeech_eval_other"

NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.