AI Model Zoo
  • Home
  • 3D Convert
  • GLTF Editor
  • DreamTexture.js
  • UnrealSynth
  • NSDT Studio
  • BimAnt
Multimodal
Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning
Computer Vision
Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification
Natural Language Processing
Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity
Audio
Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection
Tabular
Tabular Classification Tabular Regression
Reinforcement Learning
Reinforcement Learning Robotics

Models with tag image-to-text retrieved: 311

nlpconnect/vit-gpt2-image-captioning image-to-text
Salesforce/blip-image-captioning-large image-to-text
Salesforce/blip-image-captioning-base image-to-text
microsoft/trocr-large-printed image-to-text
microsoft/trocr-base-handwritten image-to-text
microsoft/trocr-large-stage1 image-to-text
microsoft/trocr-base-printed image-to-text
Salesforce/blip2-opt-2.7b image-to-text
google/pix2struct-textcaps-base image-to-text
fxmarty/pix2struct-tiny-random image-to-text
microsoft/git-base image-to-text
Salesforce/blip2-flan-t5-xl image-to-text
Salesforce/instructblip-vicuna-7b image-to-text
kha-white/manga-ocr-base image-to-text
google/pix2struct-base image-to-text
microsoft/trocr-small-handwritten image-to-text
naver-clova-ix/donut-base-finetuned-cord-v2 image-to-text
laion/mscoco_finetuned_CoCa-ViT-L-14-laion2B-s13B-b90k image-to-text
HuggingFaceM4/idefics-9b image-to-text
naver-clova-ix/donut-base image-to-text
  • «
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • ...
  • 16
  • »
© 2023 BimAnt