generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

jazz_clm-model

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 365 0.6431
0.6104 2.0 730 0.6238
0.5859 3.0 1095 0.6103
0.5859 4.0 1460 0.6046
0.5724 5.0 1825 0.5941
0.5574 6.0 2190 0.5930
0.5478 7.0 2555 0.5841
0.5478 8.0 2920 0.5821
0.5395 9.0 3285 0.5810
0.527 10.0 3650 0.5755
0.5237 11.0 4015 0.5728
0.5237 12.0 4380 0.5691
0.5147 13.0 4745 0.5733
0.5141 14.0 5110 0.5680
0.5141 15.0 5475 0.5657
0.5076 16.0 5840 0.5719
0.5004 17.0 6205 0.5691
0.4985 18.0 6570 0.5687
0.4985 19.0 6935 0.5679
0.4926 20.0 7300 0.5689
0.4914 21.0 7665 0.5667
0.4885 22.0 8030 0.5631
0.4885 23.0 8395 0.5666
0.4862 24.0 8760 0.5668
0.4825 25.0 9125 0.5643
0.4825 26.0 9490 0.5652
0.4828 27.0 9855 0.5667
0.48 28.0 10220 0.5657
0.4775 29.0 10585 0.5654
0.4775 30.0 10950 0.5660

Framework versions