Danish BERT for hate speech (offensive language) detection

The BERT HateSpeech model detects whether a Danish text is offensive or not. It is based on the pretrained Danish BERT model by BotXO which has been fine-tuned on social media data.

See the DaNLP documentation for more details.

Here is how to use the model:

from transformers import BertTokenizer, BertForSequenceClassification

model = BertForSequenceClassification.from_pretrained("alexandrainst/da-hatespeech-detection-base")
tokenizer = BertTokenizer.from_pretrained("alexandrainst/da-hatespeech-detection-base")

Training data

The data used for training has not been made publicly available. It consists of social media data manually annotated in collaboration with Danmarks Radio.

Danish BERT for hate speech (offensive language) detection

Training data

NSDT 3DConvert

UnrealSynth

DreamTexture.js