XLM_R_Galen-caresA
This model is a finetuned version of XLM_R_Galen for the Cares Area dataset used in a benchmark in the paper TODO. The model has a F1 of 0.989
Please refer to the original publication for more information TODO LINK
Parameters used
| parameter | Value |
|---|---|
| batch size | 16 |
| learning rate | 4e-05 |
| classifier dropout | 0.1 |
| warmup ratio | 0 |
| warmup steps | 0 |
| weight decay | 0 |
| optimizer | AdamW |
| epochs | 10 |
| early stopping patience | 3 |
BibTeX entry and citation info
TODO