Samenvatting
Translating to and from low-resource polysynthetic languages present numerous challenges for NMT. We present the results of our systems for the English--Inuktitut language pair for the WMT 2020 translation tasks. We investigated the importance of correct morphological segmentation, whether or not adding data from a related language (Greenlandic) helps, and whether using contextual word embeddings improves translation. While each method showed some promise, the results are mixed.
Originele taal-2 | English |
---|---|
Titel | Proceedings of the Fifth Conference on Machine Translation (WMT) |
Uitgeverij | Association for Computational Linguistics (ACL) |
Pagina's | 274-281 |
Aantal pagina's | 8 |
Status | Published - nov.-2020 |
Evenement | Fifth Conference on Machine Translation - Online Duur: 19-nov.-2020 → 20-nov.-2020 |
Conference
Conference | Fifth Conference on Machine Translation |
---|---|
Verkorte titel | WMT20 |
Periode | 19/11/2020 → 20/11/2020 |