generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

flan-t5-large-da-multiwoz2.0_400-ep10-nonstop

This model is a fine-tuned version of google/flan-t5-large on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss Accuracy Num Gen Len
1.3288 0.58 200 0.5718 27.6241 7358 13.9315
0.5941 1.17 400 0.4745 32.0484 7358 14.2517
0.5213 1.75 600 0.4283 33.6716 7358 15.1218
0.4891 2.33 800 0.4063 34.1115 7358 14.8005
0.4503 2.92 1000 0.3935 35.9067 7358 15.9095
0.4213 3.5 1200 0.3833 37.2591 7358 16.0005
0.4184 4.08 1400 0.3795 37.696 7358 15.478
0.3959 4.66 1600 0.3762 37.401 7358 14.8752
0.3847 5.25 1800 0.3714 37.7347 7358 15.9304
0.3779 5.83 2000 0.3710 38.6814 7358 14.9257
0.3776 6.41 2200 0.3681 38.4266 7358 15.2517
0.3601 7.0 2400 0.3669 38.6749 7358 15.1791
0.3504 7.58 2600 0.3669 39.2748 7358 15.4308
0.3568 8.16 2800 0.3650 39.798 7358 15.8966
0.3528 8.75 3000 0.3630 39.912 7358 15.4081
0.3463 9.33 3200 0.3641 40.1243 7358 15.5367
0.3439 9.91 3400 0.3642 40.0567 7358 15.3842

Framework versions