AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag speech retrieved: 692

nvidia/stt_en_conformer_transducer_xlarge speech
nvidia/tts_hifigan speech
boris/xlsr-en-punctuation speech
facebook/wav2vec2-conformer-rope-large speech
superb/hubert-base-superb-er speech
facebook/wav2vec2-large-xlsr-53-german speech
nvidia/tts_en_fastpitch speech
indonesian-nlp/wav2vec2-large-xlsr-indonesian speech
KBLab/wav2vec2-large-voxrex-swedish speech
pyannote/brouhaha speech
microsoft/wavlm-base-sv speech
asapp/sew-tiny-100k-ft-ls100h speech
facebook/wav2vec2-conformer-rel-pos-large-960h-ft speech
elgeish/wav2vec2-large-xlsr-53-arabic speech
nvidia/stt_ru_conformer_transducer_large speech
asapp/sew-d-tiny-100k-ft-ls100h speech
nvidia/stt_en_fastconformer_ctc_large speech
ydshieh/wav2vec2-large-xlsr-53-chinese-zh-cn-gpt speech
viktor-enzell/wav2vec2-large-voxrex-swedish-4gram speech
microsoft/unispeech-sat-large speech
  • «
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • ...
  • 35
  • »
© 2023 BimAnt