image-to-text - AI Model Zoo - BimAnt

Multimodal

Feature Extraction Text-to-Image Image-to-Text Text-to-Video Visual Question Answering Document Question Answering Graph Machine Learning

Computer Vision

Depth Estimation Image Classification Object Detection Image Segmentation Image-to-Image Unconditional Image Generation Video Classification Zero-Shot Image Classification

Natural Language Processing

Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Conversational Text Generation Text2Text Generation Fill-Mask Sentence Similarity

Audio

Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection

Tabular

Tabular Classification Tabular Regression

Reinforcement Learning

Reinforcement Learning Robotics

Models with tag image-to-text retrieved: 311

nlpconnect/vit-gpt2-image-captioning image-to-text

Salesforce/blip-image-captioning-large image-to-text

Salesforce/blip-image-captioning-base image-to-text

microsoft/trocr-large-printed image-to-text

microsoft/trocr-base-handwritten image-to-text

microsoft/trocr-large-stage1 image-to-text

microsoft/trocr-base-printed image-to-text

Salesforce/blip2-opt-2.7b image-to-text

google/pix2struct-textcaps-base image-to-text

fxmarty/pix2struct-tiny-random image-to-text

microsoft/git-base image-to-text

Salesforce/blip2-flan-t5-xl image-to-text

Salesforce/instructblip-vicuna-7b image-to-text

kha-white/manga-ocr-base image-to-text

google/pix2struct-base image-to-text

microsoft/trocr-small-handwritten image-to-text

naver-clova-ix/donut-base-finetuned-cord-v2 image-to-text

laion/mscoco_finetuned_CoCa-ViT-L-14-laion2B-s13B-b90k image-to-text

HuggingFaceM4/idefics-9b image-to-text

naver-clova-ix/donut-base image-to-text