generated_from_trainer

<!-- This model card has been generated automatically according to the information the Trainer had access to. You should probably proofread and complete it, then remove this comment. -->

Llama-2-7b-hf-textworld_cooking_augmented_sft

This model is a fine-tuned version of meta-llama/Llama-2-7b-hf on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Training results

Training Loss Epoch Step Validation Loss
0.1205 0.1 814 0.1465
0.1053 0.2 1628 0.1362
0.1007 0.3 2442 0.1302
0.0937 0.4 3256 0.1299

Framework versions