AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag speech retrieved: 692

mrm8488/wav2vec2-large-xlsr-53-ukrainian speech
mrshu/wav2vec2-large-xlsr-slovene speech
tugstugi/wav2vec2-large-xlsr-53-mongolian speech
vumichien/wav2vec2-large-pitch-recognition speech
cwkeam/mctct-large speech
nvidia/stt_en_citrinet_768_ls speech
Edresson/wav2vec2-large-100k-voxpopuli-ft-TTS-Dataset-russian speech
nvidia/stt_be_conformer_transducer_large speech
nvidia/stt_it_conformer_ctc_large speech
Teerawach12/wav2vec2-test-for-finalproject speech
carlosdanielhernandezmena/stt_fo_quartznet15x5_sp_ep163_100h speech
nvidia/stt_eo_conformer_transducer_large speech
projecte-aina/stt-ca-citrinet-512 speech
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H speech
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H_0.26 speech
ypluit/stt_kr_citrinet1024_PublicCallCenter_1000H_0.22 speech
inOXcrm/German_multispeaker_FastPitch_nemo speech
flozi00/whisper-large-german-lora-cv13 speech
chunpingvi/wav2vec2-base-vietnamese speech
nikhilanvekar2001/Hindi_asr_with_LM speech
  • «
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • ...
  • 35
  • »
© 2023 BimAnt