Marian-MT-BBC-PCM-EN
-
source language: pcm (Nigerian Pidgin)
-
target language: en (English)
-
dataset:
- Parallel Sentences from the message translation (English) and Pidgin translation of the Bible.
- Pidgin sentences from BBC Pidgin and English translation by GPT3.5-turbo
-
model: transformer-align
-
pre-processing: normalization + SentencePiece
Performance
TBA