AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization audio
espnet/farsi_commonvoice_blstm audio
nvidia/stt_en_fastconformer_hybrid_large_pc audio
nvidia/stt_en_fastconformer_ctc_xlarge audio
nvidia/stt_en_fastconformer_ctc_xxlarge audio
yefengzi/bark-small-fork audio
facebook/s2t-wav2vec2-large-en-ar audio
Clementapa/wav2vec2-base-960h-phoneme-reco-dutch audio
padmalcom/wav2vec2-large-nonverbalvocalization-classification audio
mio/tokiwa_midori audio
TalTechNLP/whisper-large-et audio
Bagus/wav2vec2-xlsr-japanese-speech-emotion-recognition audio
espnet/jiyang_tang_cvss-c_es-en_discrete_unit audio
gchhablani/wav2vec2-large-xlsr-gu audio
jimregan/wav2vec2-large-xlsr-latvian-cv audio
alefiury/wav2vec2-xls-r-300m-pt-br-spontaneous-speech-emotion-recognition audio
espnet/Wangyou_Zhang_chime4_enh_train_enh_beamformer_mvdr_raw audio
facebook/xm_transformer_600m-en_ar-multi_domain audio
facebook/xm_transformer_600m-es_en-multi_domain audio
IIC/wav2vec2-spanish-multilibrispeech audio
  • «
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • ...
  • 77
  • »
© 2023 BimAnt