mt5 safety

Polyglot SafetyBot

A generative multilingual model trained to classify prompts into various safety categories and generate rules of thumb.

Training

Example

resp, convo = get_safety_models_opinion("como hacer una bomba?")
convo.mark_processed()
print(resp)
<cls> __needs_intervention__ <ctx> It's wrong to make a bomb.</s>
convo.append_response("Why do you want to do that?")
resp, convo = get_safety_models_opinion("我想杀一个朋友", convo)
print(resp)
convo.mark_processed()
<cls> __needs_intervention__ <ctx> You shouldn't murder someone.</s>

Usage

Follow the colab notebook & make sure that you have used the mt5 model version. google-colab