A Demo BERT classification model Trained on (Part of) Yelp Dataset
Photo2Text model: ydshieh/vit-gpt2-coco-en
Expected / Standard Input:
[CLS] Business Name [SEP] Address [SEP] City [SEP] Photo2Text Outputs ...
Example:
[CLS] Paws The Cat Cafe [SEP] 10588 109 Street [SEP] Edmonton [SEP] A cup of coffee
Expected Output: 5