Model Name Parameters Class Ratio Tokens Batch Size (Tokens) Training Loss
GerbilLab/Gerbil-A-32m 32m A-Class 20 640M 262K 4.048700