AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag audio retrieved: 1539

facebook/textless_sm_pt_fr audio
carlosdanielhernandezmena/stt_mt_quartznet15x5_sp_ep255_64h audio
Jzuluaga/wav2vec2-large-960h-lv60-self-en-atc-atcosim audio
PaulChimzy/stt_rw_conformer_CTC_large audio
projecte-aina/tts-ca-coqui-vits-multispeaker audio
pyf98/chime4_e_branchformer_e10 audio
pyf98/chime4_conformer_e12_linear2048 audio
pyf98/librispeech_100_ctc_e_branchformer audio
asapp/e_branchformer_librispeech audio
ulysses115/pmvoice audio
soumi-maiti/libri2mix_eend_ss audio
Sangramsing/whisper-tiny audio
Sangramsing/whisper-base audio
AQuarterMile/opencpop_visinger1 audio
legekka/diana-hungarian-tts-vits audio
padmalcom/wav2vec2-asr-ultimate-german audio
nvidia/stt_hr_fastconformer_hybrid_large_pc audio
nvidia/stt_be_fastconformer_hybrid_large_pc audio
fujie/fujie_jvs_ms_tts_finetune_xvector_vits_raw_phn_jaconv_pyopenjtalk_prosody audio
arc-r/faster-whisper-large-v2-jp audio
  • «
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • ...
  • 77
  • »
© 2023 BimAnt