xlm-roberta-large-meddocan
This model is a finetuned version of xlm-roberta-large for the meddocan dataset used in a benchmark in the paper TODO. The model has a F1 of 0.978
Please refer to the original publication for more information TODO LINK
Parameters used
parameter | Value |
---|---|
batch size | 16 |
learning rate | 4e-05 |
classifier dropout | 0.2 |
warmup ratio | 0 |
warmup steps | 0 |
weight decay | 0 |
optimizer | AdamW |
epochs | 10 |
early stopping patience | 3 |
BibTeX entry and citation info
TODO