English2Dialect
This model generates translations from English to colloquial Arabic, conditioning the translations on dialect. Supported dialects are: Palestinian/Jordanian, Syrian, Lebanese and Egyptian. Dialect is specified via the first input token which should be either P/S/L/E.
For example, to translate to Palestinian/Jordanian, use the input:
P What time is it now?
And for Syrian:
S What time is it now?
- Demo: https://huggingface.co/spaces/guymorlan/English2Shami
- Version w/o conditional generation (Levantine only): https://huggingface.co/guymorlan/English2Shami <!-- Provide a quick summary of what the model is/does. -->
Training Data
The model was trained by fine-tuning the opus-mt-ar-en (MSA to English) model on ~85K parallel sentences in four dialects of colloquial Arabic.
Model Description
<!-- Provide a longer summary of what this model is. -->
- Developed by: Guy Mor-Lan (guy.mor@mail.huji.ac.il)
- Model type: MarianMT Seq2Seq
- License: MIT
- Finetuned from model: Helsinki-NLP/opus-mt-ar-en[https://huggingface.co/Helsinki-NLP/opus-mt-ar-en]
<!--- Repository: [More Information Needed] -->