autotrain vision image-classification

Model Description:

This model embodies a Vision Transformer (ViT) architecture tailored for image classification tasks. It's honed to accurately categorize images into three specific classes: Button, RadioButton, and CheckBox.

Training Data:

It's trained on a dataset comprising images from the three classes—Button, RadioButton, and CheckBox—enabling it to adeptly recognize and classify these distinct visual elements.

Model Trained Using AutoTrain

Validation Metrics