nli

Eval results

We obtain the following results on validation and test sets:

Set F1<sub>micro</sub> F1<sub>macro</sub>
validation 89.2 87.6
test 88.9 87.4