Efficient solutions for word reordering in German-English phrase-based statistical machine translation

Arianna Bisazza, Marcello Federico

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

3 Downloads (Pure)

Abstract

Despite being closely related languages, German and English are characterized by important word order differences. Long- range reordering of verbs, in particular, represents a real challenge for state-of-the-art SMT systems and is one of the main reasons why translation quality is often so poor in this language pair. In this work, we review several solutions to improve the accuracy of German-English word reordering while preserving the efficiency of phrase-based decoding. Among these, we consider a novel technique to dynamically shape the reordering search space and effectively capture long-range reordering phenomena. Through an extensive evaluation including diverse translation quality metrics, we show that these solutions can significantly narrow the gap between phrase-based and hierarchical SMT.
Original languageEnglish
Title of host publicationProceedings of the Eighth Workshop on Statistical Machine Translation
PublisherAssociation for Computational Linguistics (ACL)
Pages440-451
Number of pages12
Publication statusPublished - 2013
Externally publishedYes
EventWorkshop on Statistical Machine Translation - Sofia, Bulgaria
Duration: 8-Aug-20139-Aug-2013
Conference number: 8

Workshop

WorkshopWorkshop on Statistical Machine Translation
Country/TerritoryBulgaria
CitySofia
Period08/08/201309/08/2013

Cite this