HindBERT-Scratch

HindBERT is a Hindi BERT model. It is a base-BERT model trained from scratch on publicly available Hindi monolingual datasets. [project link] (https://github.com/l3cube-pune/MarathiNLP)

More details on the dataset, models, and baseline results can be found in our [paper] (<a href='https://arxiv.org/abs/2211.11418'> link </a>)

The best version of model is shared <a href='https://huggingface.co/l3cube-pune/hindi-bert-v2'> here </a>

Citing:

@article{joshi2022l3cubehind,
author = {Joshi, Raviraj},
year = {2022},
month = {09},
pages = {},
title = {L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for Devanagari based Hindi and Marathi Languages},
doi = {10.13140/RG.2.2.14606.84809}
}

Other Models trained from scratch are listed below: <br> <a href='https://huggingface.co/l3cube-pune/marathi-bert-scratch'> Marathi-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/marathi-tweets-bert-scratch'> Marathi-Tweets-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-bert-scratch'> Hindi-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-bert-scratch'> Dev-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/kannada-bert-scratch'> Kannada-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/telugu-bert-scratch'> Telugu-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/malayalam-bert-scratch'> Malayalam-Scratch </a> <br> <a href='https://huggingface.co/l3cube-pune/gujarati-bert-scratch'> Gujarati-Scratch </a> <br>

Better versions of Monolingual Indic BERT models are listed below: <br> <a href='https://huggingface.co/l3cube-pune/marathi-bert-v2'> Marathi BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/marathi-roberta'> Marathi RoBERTa </a> <br> <a href='https://huggingface.co/l3cube-pune/marathi-albert'> Marathi AlBERT </a> <br>

<a href='https://huggingface.co/l3cube-pune/hindi-bert-v2'> Hindi BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-roberta'> Hindi RoBERTa </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-albert'> Hindi AlBERT </a> <br>

<a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-bert'> Dev BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-roberta'> Dev RoBERTa </a> <br> <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-albert'> Dev AlBERT </a> <br>

<a href='https://huggingface.co/l3cube-pune/kannada-bert'> Kannada BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/telugu-bert'> Telugu BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/malayalam-bert'> Malayalam BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/tamil-bert'> Tamil BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/gujarati-bert'> Gujarati BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/odia-bert'> Oriya BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/bengali-bert'> Bengali BERT </a> <br> <a href='https://huggingface.co/l3cube-pune/punjabi-bert'> Punjabi BERT </a> <br>