Test model for DL4NLP 2022 HW06
xtremedistil-l6-h256-uncased trained on SQuAD
Hyper parameters
- learning rate: 1e-5
- weight decay: 0.01
- warm up steps: 0
- learning rate scheduler: linear
- epochs: 1
Metric results on the dev set
- F1: 65.48
- EM: 51.67
xtremedistil-l6-h256-uncased trained on SQuAD