AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics
...
NSDT 3DConvert

Convert 30+ 3D formats online: GLTF, GLB, GBX, OBJ, DAE, IFC, STEP, STL...

...
UnrealSynth

Unreal engine based photo realistic synthetic data generator for YOLO.

...
DreamTexture.js

AI powered 3d texture generation and projection SDK for three.js.

Models with tag speech retrieved: 692

jonatasgrosman/wav2vec2-large-xlsr-53-english speech
pyannote/segmentation speech
pyannote/speaker-diarization speech
jonatasgrosman/wav2vec2-large-xlsr-53-russian speech
facebook/wav2vec2-xlsr-53-espeak-cv-ft speech
jonatasgrosman/wav2vec2-large-xlsr-53-portuguese speech
nvidia/speakerverification_en_titanet_large speech
pyannote/speaker-diarization-3.0 speech
pyannote/segmentation-3.0 speech
jonatasgrosman/wav2vec2-large-xlsr-53-arabic speech
facebook/wav2vec2-large-robust-ft-swbd-300h speech
microsoft/wavlm-large speech
facebook/wav2vec2-base speech
pyannote/embedding speech
facebook/wav2vec2-large-xlsr-53 speech
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim speech
facebook/hubert-large-ls960-ft speech
jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn speech
pyannote/voice-activity-detection speech
facebook/hubert-base-ls960 speech
  • «
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • ...
  • 35
  • »
© 2023 BimAnt