generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

20230823035201

This model is a fine-tuned version of bert-large-cased on the super_glue dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss Accuracy
No log 1.0 312 0.0745 0.5379
0.1506 2.0 624 0.0870 0.5379
0.1506 3.0 936 0.1588 0.4729
0.1116 4.0 1248 0.1264 0.5271
0.1028 5.0 1560 0.0727 0.4729
0.1028 6.0 1872 0.0738 0.5632
0.0952 7.0 2184 0.0757 0.4729
0.0952 8.0 2496 0.0780 0.5271
0.0812 9.0 2808 0.0723 0.5523
0.08 10.0 3120 0.0704 0.4729
0.08 11.0 3432 0.0712 0.4729
0.0806 12.0 3744 0.0701 0.4982
0.0789 13.0 4056 0.0712 0.4729
0.0789 14.0 4368 0.0700 0.5090
0.0785 15.0 4680 0.0702 0.5596
0.0785 16.0 4992 0.0721 0.4729
0.0797 17.0 5304 0.0701 0.4946
0.0789 18.0 5616 0.0722 0.4729
0.0789 19.0 5928 0.0704 0.4801
0.0789 20.0 6240 0.0713 0.6173
0.0787 21.0 6552 0.0703 0.5090
0.0787 22.0 6864 0.0701 0.4838
0.0789 23.0 7176 0.0699 0.4946
0.0789 24.0 7488 0.0700 0.4874
0.0789 25.0 7800 0.0708 0.5560
0.0782 26.0 8112 0.0704 0.4765
0.0782 27.0 8424 0.0711 0.4729
0.0783 28.0 8736 0.0702 0.5523
0.0784 29.0 9048 0.0715 0.4657
0.0784 30.0 9360 0.0726 0.6209
0.0784 31.0 9672 0.0702 0.5921
0.0784 32.0 9984 0.0708 0.4765
0.0782 33.0 10296 0.0705 0.4729
0.0781 34.0 10608 0.0700 0.4874
0.0781 35.0 10920 0.0702 0.4765
0.0778 36.0 11232 0.0706 0.4729
0.0783 37.0 11544 0.0700 0.5776
0.0783 38.0 11856 0.0709 0.4765
0.0776 39.0 12168 0.0699 0.4801
0.0776 40.0 12480 0.0699 0.5921
0.0771 41.0 12792 0.0698 0.5632
0.0772 42.0 13104 0.0699 0.5884
0.0772 43.0 13416 0.0700 0.6282
0.0771 44.0 13728 0.0699 0.4874
0.0772 45.0 14040 0.0700 0.6318
0.0772 46.0 14352 0.0697 0.4982
0.0768 47.0 14664 0.0699 0.4874
0.0768 48.0 14976 0.0696 0.5704
0.0768 49.0 15288 0.0696 0.6282
0.0769 50.0 15600 0.0699 0.4838
0.0769 51.0 15912 0.0697 0.5162
0.0773 52.0 16224 0.0701 0.4765
0.0767 53.0 16536 0.0696 0.5199
0.0767 54.0 16848 0.0698 0.6606
0.0763 55.0 17160 0.0703 0.4874
0.0763 56.0 17472 0.0696 0.6101
0.0763 57.0 17784 0.0695 0.5776
0.0765 58.0 18096 0.0697 0.5126
0.0765 59.0 18408 0.0695 0.5451
0.0765 60.0 18720 0.0695 0.5704

Framework versions