grizzled-interest-2023-03-29

This model is a fine-tuned version of facebook/mbart-large-cc25 on the mtc/newsum2021 dataset. It achieves the following results on the test set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
3.6815	0.89	500	3.5617	29.5414	10.5201	20.056	27.2581	86.07
3.4132	1.79	1000	3.4133	29.6393	9.9545	19.6903	27.0861	96.545
3.198	2.68	1500	3.3693	29.8614	10.4517	20.1728	27.3879	94.31
3.0292	3.58	2000	3.3370	30.6444	11.5935	21.1955	28.2699	87.355
2.901	4.47	2500	3.3440	30.7453	11.111	21.2076	28.269	88.365
2.7832	5.37	3000	3.3758	30.4995	10.9025	20.6601	28.0575	104.655
2.6965	6.26	3500	3.3793	31.2287	11.5544	21.1909	28.738	88.47
2.6475	7.16	4000	3.4083	32.0341	11.9417	22.2785	29.2495	84.095
2.6196	8.05	4500	3.4007	30.8963	11.3811	21.3146	28.3222	90.875
2.5574	8.94	5000	3.4104	32.3867	12.0469	21.9831	29.5205	87.46
2.4977	9.84	5500	3.4340	32.5857	12.5072	22.6288	30.1168	79.87
2.4362	10.73	6000	3.4626	31.9121	11.8577	22.3647	29.3822	85.17
2.3977	11.63	6500	3.4737	32.0202	12.0413	22.5237	29.5166	77.905
2.369	12.52	7000	3.4890	31.2516	11.3416	21.5711	28.5465	85.605
2.3446	13.42	7500	3.4949	32.1277	11.6876	22.0244	29.2239	83.895
2.3295	14.31	8000	3.4976	31.8729	11.629	21.9629	28.9948	84.47