<table> <tr> <td style="width: 30%; text-align: left; vertical-align: middle">

CurtGPT

Using Microsoft's Phi 1.5 model like it was never intended.

</td> <td style="text-align: center;"> <img src="https://github.com/tim-a-davis/silly_little_language_modeling_thing_at_utd/blob/main/curtgpt%20logo.png?raw=true" width="300" height="auto"> </td> </tr> </table>

Main Procedure

This model is an adapter on puffin phi v2 trained using QLoRA and DPO on 60,000 samples from the anthropic helpful only dataset.


library_name: peft

Training procedure

The following bitsandbytes quantization config was used during training:

Framework versions