generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

gpt-expt-sp-v3-K-600-MA-Mac-actions-kmeans-v4

This model is a fine-tuned version of gpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss
0.1593 21.46 5000 0.0853
0.0847 42.92 10000 0.1198
0.0406 64.38 15000 0.0613
0.0324 85.83 20000 0.0307
0.0238 107.3 25000 0.0211
0.0207 128.75 30000 0.0184
0.0193 150.21 35000 0.0176
0.0185 171.67 40000 0.0171
0.018 193.13 45000 0.0170
0.0177 214.59 50000 0.0167
0.0174 236.05 55000 0.0167
0.0172 257.51 60000 0.0166
0.017 278.97 65000 0.0165
0.0169 300.43 70000 0.0164
0.0168 321.89 75000 0.0164
0.0167 343.35 80000 0.0163
0.0166 364.8 85000 0.0163
0.0165 386.27 90000 0.0163
0.0164 407.72 95000 0.0162
0.0164 429.18 100000 0.0162
0.0163 450.64 105000 0.0162
0.0163 472.1 110000 0.0162
0.0163 493.56 115000 0.0162

Framework versions