generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

Magical2

This model is a fine-tuned version of crumb/gpt-joke on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 33 1.6553
No log 2.0 66 1.5406
No log 3.0 99 1.4875
No log 4.0 132 1.4571
No log 5.0 165 1.4472
No log 6.0 198 1.4450
No log 7.0 231 1.4522
No log 8.0 264 1.4694
No log 9.0 297 1.4754
No log 10.0 330 1.4947
No log 11.0 363 1.5067
No log 12.0 396 1.5227
No log 13.0 429 1.5341
No log 14.0 462 1.5436
No log 15.0 495 1.5533

Framework versions