This repository shares smaller version of bert-base-multilingual-uncased that keeps only Ukrainian, English, and Russian tokens in the vocabulary.
Model | Num parameters | Size |
---|---|---|
bert-base-multilingual-uncased | 167 million | ~650 MB |
MaxVortman/bert-base-ukr-eng-rus-uncased | 110 million | ~423 MB |