AI Model Zoo
Home
3D Convert
GLTF Editor
DreamTexture.js
UnrealSynth
NSDT Studio
BimAnt
Multimodal
Feature Extraction
Text-to-Image
Image-to-Text
Text-to-Video
Visual Question Answering
Document Question Answering
Graph Machine Learning
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Image-to-Image
Unconditional Image Generation
Video Classification
Zero-Shot Image Classification
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Conversational
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Reinforcement Learning
Reinforcement Learning
Robotics
Models with tag
visual-question-answering
retrieved: 119
tifa-benchmark/promptcap-coco-vqa
visual-question-answering
microsoft/git-base-vqav2
visual-question-answering
microsoft/git-large-vqav2
visual-question-answering
google/matcha-chartqa
visual-question-answering
google/matcha-base
visual-question-answering
google/pix2struct-widget-captioning-large
visual-question-answering
google/pix2struct-infographics-vqa-large
visual-question-answering
NhatDFO/sf_blip2
visual-question-answering
Gregor/mblip-mt0-xl
visual-question-answering
kpyu/video-blip-opt-2.7b-ego4d
visual-question-answering
google/pix2struct-screen2words-large
visual-question-answering
google/pix2struct-ai2d-large
visual-question-answering
paragon-AI/blip2-image-to-text
visual-question-answering
kpyu/video-blip-flan-t5-xl-ego4d
visual-question-answering
google/pix2struct-widget-captioning-base
visual-question-answering
google/matcha-chart2text-pew
visual-question-answering
google/pix2struct-screen2words-base
visual-question-answering
ivelin/donut-refexp-combined-v1
visual-question-answering
google/matcha-plotqa-v1
visual-question-answering
microsoft/git-large-textvqa
visual-question-answering
«
1
2
3
4
5
6
»