AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

facebook/textless_sm_sl_en audio
nvidia/stt_it_conformer_ctc_large audio
Teerawach12/wav2vec2-test-for-finalproject audio
carlosdanielhernandezmena/stt_fo_quartznet15x5_sp_ep163_100h audio
vasista22/wav2vec2-360h-base-ft-100h audio
nvidia/stt_eo_conformer_transducer_large audio
projecte-aina/stt-ca-citrinet-512 audio
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H audio
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H_0.26 audio
espnet/kmiyazaki_librispeech_asr_s4_decoder audio
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H_0.22 audio
Kamtera/persian-tts-female-vits audio
carlosdanielhernandezmena/whisper-small-faroese-5k-steps-100h audio
arijitx/whisper-small-bn audio
Sangramsing/whisper-small audio
Sangramsing/whisper-medium audio
flozi00/whisper-large-german-lora-cv13 audio
chunpingvi/wav2vec2-base-vietnamese audio
nikhilanvekar2001/Hindi_asr_with_LM audio
tjysdsg/11692_cyclic_asr_tts_gumbel_softmax_init audio
  • «
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • ...
  • 77
  • »
© 2023 BimAnt