AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

espnet/kan-bayashi_libritts_gst_xvector_conformer_fastspeech2 audio
espnet/kan-bayashi_libritts_gst_xvector_trasnformer audio
espnet/kan-bayashi_libritts_tts_train_gst_xvector_conformer_fastspeech2_trans-truncated-c3209b audio
espnet/kan-bayashi_ljspeech_tts_train_vits_raw_phn_tacotron_g2p_en_no_space_train.total_count.ave audio
espnet/kan-bayashi_vctk_gst_transformer audio
facebook/s2t-small-covost2-fr-en-st audio
facebook/wav2vec2-base-fr-voxpopuli audio
facebook/wav2vec2-large-fr-voxpopuli audio
joaoalvarenga/wav2vec2-large-xlsr-portuguese audio
manandey/wav2vec2-large-xlsr-punjabi audio
mohammed/wav2vec2-large-xlsr-arabic audio
tugstugi/wav2vec2-large-xlsr-53-kalmyk audio
voidful/wav2vec2-large-xlsr-53-hk audio
pyf98/librispeech_conformer_layerdrop0.1_last6 audio
espnet/marathi_openslr64 audio
facebook/wav2vec2-conformer-rope-large-100h-ft audio
espnet/english_male_ryanspeech_conformer_fastspeech2 audio
joaogante/test_audio audio
SYSPIN/Telugu_Male_TTS audio
nvidia/stt_kab_conformer_transducer_large audio
  • «
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • ...
  • 77
  • »
© 2023 BimAnt