Fhrozen/tts_prodiff_jp_multispk - AI Model Zoo

espnet audio text-to-speech

TTS model (Japanese) - ProDiff with GST + X-Vector

No support given.

num_iters_per_epoch: 250
max_epoch: 600
batch_bins: 6000000
tts_conf: 
    spk_embed_dim: 192

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

Unreal engine based photo realistic synthetic data generator for YOLO.

AI powered 3d texture generation and projection SDK for three.js.