roberta-large-bne-ehealth_kd
This model is a finetuned version of roberta-large-bne for the eHealth-KD dataset used in a benchmark in the paper TODO. The model has a F1 of 0.836
Please refer to the original publication for more information TODO LINK
Parameters used
| parameter | Value | 
|---|---|
| batch size | 16 | 
| learning rate | 2e-05 | 
| classifier dropout | 0 | 
| warmup ratio | 0 | 
| warmup steps | 0 | 
| weight decay | 0 | 
| optimizer | AdamW | 
| epochs | 10 | 
| early stopping patience | 3 | 
BibTeX entry and citation info
TODO