Training procedure

Fine-tuned version of Falcon-180B using PEFT LoRA + DeepSpeed ZeRO3 + Flash Attention + Activation Checkpointing. Read the blog Falcon 180B Finetuning using 🤗 PEFT and DeepSpeed for more information.

Framework versions