Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-A-32m | 32m | A-Class | 20 | 640M | 262K | 4.048700 |
Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-A-32m | 32m | A-Class | 20 | 640M | 262K | 4.048700 |