generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

flan-t5-large-da-multiwoz2.1_400

This model is a fine-tuned version of google/flan-t5-large on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss Accuracy Num Gen Len
1.182 1.16 200 0.5116 29.9369 3689 13.7774
0.5435 2.33 400 0.4323 33.5836 3689 15.3494
0.4654 3.49 600 0.3987 35.5517 3689 15.91
0.4345 4.65 800 0.3823 37.6767 3689 15.8986
0.4054 5.81 1000 0.3725 38.6302 3689 15.19
0.3883 6.98 1200 0.3689 38.7642 3689 15.7896
0.3694 8.14 1400 0.3685 40.0178 3689 16.3613
0.3573 9.3 1600 0.3639 40.4155 3689 15.614
0.3465 10.47 1800 0.3633 40.8682 3689 15.9461
0.3404 11.63 2000 0.3631 40.4788 3689 16.0949
0.329 12.79 2200 0.3617 41.5163 3689 15.434
0.3176 13.95 2400 0.3662 41.2127 3689 15.8075
0.3129 15.12 2600 0.3665 41.2478 3689 15.5636
0.3153 16.28 2800 0.3648 41.0925 3689 15.6617
0.3073 17.44 3000 0.3644 41.276 3689 15.7137
0.3003 18.6 3200 0.3640 41.5248 3689 15.72
0.2988 19.77 3400 0.3651 41.4459 3689 15.7322

Framework versions