AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

bond005/wav2vec2-large-ru-golos audio
NbAiLab/nb-whisper-base-beta audio
facebook/s2t-small-mustc-en-de-st audio
facebook/tts_transformer-ar-cv7 audio
Temur/wav2vec2-Georgian-Daytona audio
julien-c/DPRNNTasNet-ks16_WHAM_sepclean audio
openpecha/speecht5-tts-01 audio
elgeish/wav2vec2-base-timit-asr audio
othrif/wav2vec2-large-xlsr-moroccan audio
nguyenvulebinh/wav2vec2-base-vi-vlsp2020 audio
language-and-voice-lab/whisper-large-icelandic-62640-steps-967h audio
m3hrdadfi/wav2vec2-xlsr-greek-speech-emotion-recognition audio
Gatozu35/tortoise-tts audio
facebook/wav2vec2-large-100k-voxpopuli audio
facebook/wav2vec2-large-xlsr-53-dutch audio
NbAiLab/nb-whisper-tiny-beta audio
facebook/s2t-large-librispeech-asr audio
facebook/xm_transformer_s2ut_en-hk audio
nvidia/stt_en_fastconformer_transducer_xxlarge audio
espnet/fastspeech2_conformer audio
  • «
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • ...
  • 77
  • »
© 2023 BimAnt