AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

arampacha/wav2vec2-large-xlsr-czech audio
facebook/s2t-small-mustc-en-ru-st audio
superb/wav2vec2-large-superb-sid audio
espnet/GunnarThor_talromur_a_fastspeech2 audio
Edresson/wav2vec2-large-100k-voxpopuli-ft-Common-Voice_plus_TTS-Dataset-russian audio
JorisCos/ConvTasNet_Libri3Mix_sepclean_16k audio
byan/librispeech_asr_train_asr_conformer_raw_bpe_batch_bins30000000_accum_grad3_optim_conflr0.001_sp audio
espnet/kan-bayashi_csmsc_tacotron2 audio
indonesian-nlp/wav2vec2-large-xlsr-indonesian-baseline audio
indonesian-nlp/wav2vec2-luganda audio
popcornell/FasNetTAC-paper audio
patrickvonplaten/wav2vec2-large-960h-lv60-self-4-gram audio
nvidia/stt_zh_citrinet_1024_gamma_0_25 audio
nvidia/stt_hr_conformer_ctc_large audio
espnet/kan-bayashi_csmsc_tts_train_tacotron2_raw_phn_pypinyin_g2p_phone_train.loss.best audio
jonatasgrosman/wav2vec2-large-english audio
espnet/GunnarThor_talromur_b_fastspeech2 audio
nvidia/stt_en_citrinet_512_ls audio
lgris/base_10k_8khz_pt audio
m3hrdadfi/wav2vec2-large-xlsr-persian-shemo audio
  • «
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • ...
  • 77
  • »
© 2023 BimAnt