Training procedure

Framework versions

Main Results

Average ARC HellaSwag MMLU TruthfulQA
60.21 59.56 82.39 55.47 43.4