Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-A-3.3m | 3.3m | A-Class | 20 | 60M | 65.5k | 6.664400 |
Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-A-3.3m | 3.3m | A-Class | 20 | 60M | 65.5k | 6.664400 |