AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

paris-iea/speaker-diarization audio
Bazzar/bark-small audio
facebook/wav2vec2-base-10k-voxpopuli-ft-en audio
mpariente/ConvTasNet_Libri1Mix_enhsingle_8k audio
SLPL/Sharif-wav2vec2 audio
patrickvonplaten/wav2vec2-conformer-rel-pos-large-960h-ft-4-gram audio
teticio/audio-diffusion-ddim-256 audio
KIFF/pyannote-speaker-diarization-endpoint audio
bofenghuang/stt_fr_fastconformer_hybrid_large audio
anton-l/wav2vec2-large-xlsr-53-ukrainian audio
facebook/s2t-small-covost2-en-fa-st audio
lichenda/wsj0_2mix_skim_noncausal audio
Pranjal12345/pranjal_whisper_medium audio
facebook/wav2vec2-large-uralic-voxpopuli-v2 audio
lgris/bp400-xlsr audio
m3hrdadfi/hubert-base-greek-speech-emotion-recognition audio
imdanboy/jets audio
facebook/s2t-small-covost2-de-en-st audio
mohammed/ar audio
espnet/english_male_ryanspeech_fastspeech2 audio
  • «
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • ...
  • 77
  • »
© 2023 BimAnt