AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics

Models with tag audio retrieved: 1539

espnet/kan-bayashi_ljspeech_joint_train_conformer_fastspeech2_hifigan audio
espnet/kan-bayashi_ljspeech_tts_train_fastspeech2_raw_phn_tacotron_g2p_en_no_space_train.loss.ave audio
espnet/kan-bayashi_ljspeech_tts_train_fastspeech_raw_phn_tacotron_g2p_en_no_space_train.loss.best audio
espnet/kan-bayashi_vctk_gst_fastspeech2 audio
espnet/kan-bayashi_vctk_gst_xvector_conformer_fastspeech2 audio
espnet/kan-bayashi_vctk_gst_xvector_transformer audio
espnet/kan-bayashi_vctk_tts_train_full_band_multi_spk_vits_raw_phn_tacotron_g-truncated-50b003 audio
espnet/kan-bayashi_vctk_tts_train_xvector_tacotron2_raw_phn_tacotron_g2p_en_no_space_train.loss.ave audio
espnet/shinji-watanabe-librispeech_asr_train_asr_transformer_e18_raw_bpe_sp_valid.acc.best audio
espnet/simpleoier_librispeech_asr_train_asr_conformer7_hubert_ll60k_large_raw_en_bpe5000_sp audio
facebook/wav2vec2-base-10k-voxpopuli-ft-fi audio
facebook/wav2vec2-base-10k-voxpopuli-ft-fr audio
facebook/wav2vec2-base-de-voxpopuli-v2 audio
facebook/wav2vec2-base-es-voxpopuli audio
facebook/wav2vec2-base-it-voxpopuli-v2 audio
gchhablani/wav2vec2-large-xlsr-hu audio
gorkemgoknar/wav2vec2-large-xlsr-53-turkish audio
jimregan/wav2vec2-large-xlsr-irish-basic audio
joaoalvarenga/wav2vec2-large-xlsr-portuguese-a audio
kmfoda/wav2vec2-large-xlsr-arabic audio
  • «
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • ...
  • 77
  • »
© 2023 BimAnt