generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

bert-small2bert-small-finetuned-cnn_daily_mail-summarization-finetuned-multi_news

This model is a fine-tuned version of mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization on the multi_news dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
4.6946 0.89 400 4.5393 37.164 11.5191 20.2519 32.1568 126.415
4.5128 1.78 800 4.4185 38.2345 12.2053 20.954 33.0667 128.975
4.2926 2.67 1200 4.3866 38.4475 12.6488 21.3046 33.2768 129.0
4.231 3.56 1600 4.3808 38.7008 12.6323 21.307 33.3693 128.955
4.125 4.44 2000 4.3760 38.5318 12.7285 21.4358 33.4565 128.985

Framework versions