Efficient solutions for word reordering in German-English phrase-based statistical machine translation

Arianna Bisazza, Marcello Federico

OnderzoeksoutputAcademicpeer review

5 Downloads (Pure)

Samenvatting

Despite being closely related languages, German and English are characterized by important word order differences. Long- range reordering of verbs, in particular, represents a real challenge for state-of-the-art SMT systems and is one of the main reasons why translation quality is often so poor in this language pair. In this work, we review several solutions to improve the accuracy of German-English word reordering while preserving the efficiency of phrase-based decoding. Among these, we consider a novel technique to dynamically shape the reordering search space and effectively capture long-range reordering phenomena. Through an extensive evaluation including diverse translation quality metrics, we show that these solutions can significantly narrow the gap between phrase-based and hierarchical SMT.
Originele taal-2English
TitelProceedings of the Eighth Workshop on Statistical Machine Translation
UitgeverijAssociation for Computational Linguistics (ACL)
Pagina's440-451
Aantal pagina's12
StatusPublished - 2013
Extern gepubliceerdJa
EvenementWorkshop on Statistical Machine Translation - Sofia, Bulgaria
Duur: 8-aug.-20139-aug.-2013
Congresnummer: 8

Workshop

WorkshopWorkshop on Statistical Machine Translation
Land/RegioBulgaria
StadSofia
Periode08/08/201309/08/2013

Citeer dit