generated_from_trainer

Overview

Model Description

FILL-MAsk-RoBERTa-base is a distilled version of the RoBERTa-base model, designed for the Masked Language Modeling task. This model follows a similar training procedure as DistilBERT, resulting in a smaller model with 6 layers, 768 dimensions, and 12 attention heads. It contains a total of 82 million parameters, making it more lightweight compared to the original RoBERTa-base, which has 125 million parameters. On average, DistilRoBERTa is approximately twice as fast as RoBERTa-base.

Usage

FILL-MAsk-RoBERTa-base can be used for both direct and downstream tasks. It is suitable for masked language modeling tasks, where tokens are masked, and the model must predict the masked tokens. It is also intended to be fine-tuned on downstream tasks such as sequence classification, token classification, or question answering. Users can explore the Hugging Face Model Hub to find fine-tuned versions of this model for specific tasks of interest.

Limitations

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("fill-mask", model="ayoubkirouane/FILL-MAsk-RoBERTa-base")
result = pipe("The capital of Algeria is <mask>.")
print(result)