AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics

Models with tag multimodal retrieved: 13

HuggingFaceM4/idefics-9b multimodal
HuggingFaceM4/idefics-9b-instruct multimodal
HuggingFaceM4/idefics-80b-instruct multimodal
HuggingFaceM4/idefics-80b multimodal
sujitpal/clip-imageclef multimodal
MonoHime/meld-emo-intermodal multimodal
sshh12/Mistral-7B-LoRA-VisionCLIP-LLAVA multimodal
MonoHime/mosei-emo-intermodal multimodal
MonoHime/iemocap-emo-intermodal multimodal
waybarrios/guidance-based-video-grounding multimodal
MonoHime/mosei-senti-intermodal multimodal
MonoHime/mosi-senti-intermodal multimodal
typeof/idefics-9b multimodal
  • «
  • 1
  • »
© 2023 BimAnt