Model Card for BERTIS Model

This README provides an overview of BERTIS, its purpose and usage.

BERTIS Model Description

BERTIS (BERT-based Image Schema) is a computational model designed to classify input texts into specific Image Schema classes. Image Schemas are cognitive patterns that play a fundamental role in shaping the way humans conceptualize and reason about various concepts present in language. BERTIS is built upon the BERT architecture. The model is fine-tuned using a specialized corpus created by Wachowiak et al.(2022).

BERTIS Image Schema Classes

The main purpose of BERTIS is to automatically categorize input texts into predefined Image Schema classes. BERTIS considers 14 distinct Image Schema classes, each capturing a specific cognitive pattern. These classes and their corresponding examples (taken from Wachowiak et al. (2022)) are listed below:

<!-- ### Model Sources [optional] --

Uses

<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> <!-- ### Direct Use This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. [More Information Needed] --> <!-- ### Downstream Use [optional] This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app [More Information Needed] --> <!-- ### Out-of-Scope Use This section addresses misuse, malicious use, and uses that the model will not work well for. [More Information Needed] --> <!-- ## Bias, Risks, and Limitations This section is meant to convey both technical and sociotechnical limitations. [More Information Needed] --> <!-- ### Recommendations This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. --> <!-- ## How to Get Started with the Model Use the code below to get started with the model.-->

Training and Testing Details

The data used for training, validating and testing BERTIS is found in: https://github.com/lwachowiak/Systematic-Analysis-of-Image-Schemas-through-Explainable-Multilingual-Language-Models/blob/main/Data/Image%20Schemas%20English%20and%20German.csv;

80% of the data were used for training BERTIS, 10% for validation, and 10% for testing it. <!-- ### Training Procedure This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. --> <!-- #### Preprocessing [optional] [More Information Needed]--> <!-- #### Training Hyperparameters

Speeds, Sizes, Times [optional]

--> <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. [More Information Needed]-->

Evaluation

Metrics

Results

Class Precision Recall F1-score
"CENTER-PERIPHERY" 0.98 0.94 0.96
"CONTACT" 1 1 1
"CONTAINMENT" 0.74 0.67 0.7
"COVERING" 1 1 1
"FORCE" 0.8 0.91 0.85
"LINK" 1 1 1
"OBJECT" 0.81 0.83 0.82
"PART-WHOLE" 1 1 1
"SCALE" 1 1 1
"SOURCE_PATH_GOAL" 0.81 0.74 0.77
"SPLITTING" 1 1 1
"SUBSTANCE" 1 1 1
"SUPPORT" 1 1 1
"VERTICALITY" 0.88 0.93 0.90

Overall Accuracy : 0.93

For all Image Schema classes, F1-score is between 0.77 and 1, indicating that BERTIS performs well in terms of both accurately predicting the correct Image Schema classes (precision) and capturing many correct Image Schema classes as possible (recall). Indeed these results are also reflected in the precision and recall scores for all Image Schema classes. We observe a high precision (between 0.74 and 1) for all classes, which indicates the number of correct predicted Image Schema classes. The recall scores are above 0.74 for most of the classes, with one Image Schema class having a recall score equal to 0.67. The overall accuracy of BERTIS with respect to all Image Schema classes is equal to 0.93, indicating a high accuracy and good performance in classifying the input text into the correct Image Schema classes. <!--

Technical Specifications [optional]

Model Architecture and Objective

Compute Infrastructure

Hardware

Software

Citation [optional]

BibTeX: APA:

Glossary [optional]

More Information [optional]

Model Card Authors [optional]

Model Card Contact

Mireille Fares: mf.upmc@gmail.com-->

Developed by: Mireille Fares

Language(s) (NLP): English, can generalize to other languages such as German

License: Academic Free License

Finetuned from model: BERT Base Cased model