generated_from_keras_callback

<!-- This model card has been generated automatically according to the information Keras had access to. You should probably proofread and complete it, then remove this comment. -->

gritsys/my_awesome_eli5_clm-model

This model is a fine-tuned version of distilgpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Train Loss Validation Loss Epoch
6.7702 6.4295 0
6.3075 6.2404 1
6.1358 6.1114 2
6.0137 6.0240 3
5.9162 5.9632 4
5.8324 5.8999 5
5.7573 5.8411 6
5.6913 5.7984 7
5.6306 5.7603 8
5.5742 5.7290 9
5.5219 5.6919 10
5.4724 5.6651 11
5.4264 5.6356 12
5.3815 5.6159 13
5.3399 5.5886 14

Framework versions