Epoch : 700 on trimpixel dataset

Training procedure

The following bitsandbytes quantization config was used during training:

Framework versions