<span style="color:red">Important note</span> The inference in HuggingFace won't work, because there is a missing pre-processing step.