WineBERTo 🍷🥂

wineberto-italian-cased is a BERT model obtained by MLM adaptive-tuning bert-base-italian-xxl-cased on Italian drink recipes and wine descriptions, approximately 77k sentences (3.3M words).

Author: Cristiano De Nobili (@denocris on Twitter, LinkedIn) for VINHOOD. <p> <img src="https://drive.google.com/uc?export=view&id=1dco9I9uzevP2V6oku1salIYcovUAeqWE" width="400"> </br> </p>

Perplexity

Test set: 14k sentences about wine.

Model Perplexity
wineberto-italian-cased 2.29
bert-base-italian-xxl-cased 4.60

Usage

from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/wineberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)