ChefBERTo 👨‍🍳

chefberto-italian-cased is a BERT model obtained by MLM adaptive-tuning bert-base-italian-xxl-cased on Italian cooking recipes, approximately 50k sentences (2.6M words).

Author: Cristiano De Nobili (@denocris on Twitter, LinkedIn) for VINHOOD. <p> <img src="https://drive.google.com/uc?export=view&id=1u5aY2wKu-X5DAzbOq7rsgGFW5_lGUAQn" width="400"> </br> </p>

Perplexity

Test set: 9k sentences about food.

Model Perplexity
chefberto-italian-cased 1.84
bert-base-italian-xxl-cased 2.85

Usage

from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/chefberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)