Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-D-6.7m | 6.7m | D-Class | 142 | 951M | 131k | 4.8186 |
Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-D-6.7m | 6.7m | D-Class | 142 | 951M | 131k | 4.8186 |