WineBERTo 🍷🥂
wineberto-italian-cased is a BERT model obtained by MLM adaptive-tuning bert-base-italian-xxl-cased on Italian drink recipes and wine descriptions, approximately 77k sentences (3.3M words).
Author: Cristiano De Nobili (@denocris on Twitter, LinkedIn) for VINHOOD. <p> <img src="https://drive.google.com/uc?export=view&id=1dco9I9uzevP2V6oku1salIYcovUAeqWE" width="400"> </br> </p>
Perplexity
Test set: 14k sentences about wine.
Model | Perplexity |
---|---|
wineberto-italian-cased | 2.29 |
bert-base-italian-xxl-cased | 4.60 |
Usage
from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/wineberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)