Trained for 5 epochs on https://huggingface.co/datasets/totally-not-an-llm/EverythingLM-data. Merged model can be found here: https://huggingface.co/totally-not-an-llm/EverythingLM-13b-16k.

Training procedure

The following bitsandbytes quantization config was used during training:

Framework versions