This model has been pretrained on MS MARCO following the approach described in the paper Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval. The model can be used to reproduce the experimental results within the GitHub repository

This model is trained with BERT-large as the backbone with 335M hyperparameters.