This is a multilingual NER system trained using a Frustratingly Easy Domain Adaptation architecture. It is based on LaBSE and supports different tagsets all using IOBES formats:
- Wikiann (LOC, PER, ORG)
- SlavNER 19/21 (EVT, LOC, ORG, PER, PRO)
- SlavNER 17 (LOC, MISC, ORG, PER)
- SSJ500k (LOC, MISC, ORG, PER)
- KPWr (EVT, LOC, ORG, PER, PRO)
- CNEC (LOC, ORG, MEDIA, ART, PER, TIME)
- Turku (DATE, EVT, LOC, ORG, PER, PRO, TIME)
PER: person, LOC: location, ORG: organization, EVT: event, PRO: product, MISC: Miscellaneous, MEDIA: media, ART: Artifact, TIME: time, DATE: date
You can select the tagset to use in the output by configuring the model.
More information about the model can be found in the paper (https://aclanthology.org/2021.bsnlp-1.12.pdf) and GitHub repository (https://github.com/EMBEDDIA/NER_FEDA).