AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

mhu-coder/ConvTasNet_Libri1Mix_enhsingle audio
skylord/wav2vec2-large-xlsr-hindi audio
ales/wav2vec2-cv-be audio
nvidia/stt_de_conformer_ctc_large audio
shawn-nyk/wav2vec2-base-960h-with-lm audio
speechcatcher/speechcatcher_german_espnet_streaming_transformer_26k_train_size_xl_raw_de_bpe1024 audio
language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h audio
nvidia/stt_pl_fastconformer_hybrid_large_pc audio
voices/VCTK_British_English_Males audio
lorneluo/faster-whisper-large-v2 audio
Edresson/wav2vec2-large-100k-voxpopuli-ft-Common_Voice_plus_TTS-Dataset_plus_Data_Augmentation-russian audio
Nhut/wav2vec2-large-xlsr-vietnamese audio
espnet/kan-bayashi_jsut_transformer_prosody audio
espnet/kan-bayashi_ljspeech_tts_train_joint_conformer_fastspeech2_hifigan_raw-truncated-af8fe0 audio
espnet/kan-bayashi_vctk_multi_spk_vits audio
groadabike/ConvTasNet_DAMP-VSEP_enhboth audio
groadabike/ConvTasNet_DAMPVSEP_EnglishNonEnglish_baseline audio
julien-c/kan-bayashi_csmsc_tacotron2 audio
othrif/wav2vec2-large-xlsr-egyptian audio
superb/hubert-large-superb-ks audio
  • «
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • ...
  • 77
  • »
© 2023 BimAnt