Reassessing Claims of Human Parity and Super-Human Performance in Machine Translation at WMT 2019

Antonio Toral*

*Corresponding author voor dit werk

    OnderzoeksoutputAcademicpeer review

    29 Citaten (Scopus)
    49 Downloads (Pure)

    Samenvatting

    We reassess the claims of human parity and super-human performance made at the news shared task of WMT2019 for three translation directions: English > German, English > Russian and German >English. First we identify three potential issues in the human evaluation of that shared task: (i) the limited amount of intersentential context available, (ii) the limited translation proficiency of the evaluators and (iii) the use of a reference translation. We then conduct a modified evaluation taking these issues into account. Our results indicate that all the claims of human parity and super-human performance made at WMT2019 should be refuted, except the claim of human parity for English > German. Based on our findings, we put forward a set of recommendations and open questions for future assessments of human parity in machine translation.
    Originele taal-2English
    TitelProceedings of the 22nd Annual Conference of the European Association for Machine Translation
    Plaats van productieLisboa, Portugal
    UitgeverijEuropean Association for Machine Translation
    Pagina's185-194
    Aantal pagina's10
    StatusPublished - 2020
    EvenementAnnual Conference of the European Association for Machine Translation - Online, Portugal
    Duur: 3-nov.-20205-nov.-2020
    Congresnummer: 22

    Conference

    ConferenceAnnual Conference of the European Association for Machine Translation
    Land/RegioPortugal
    Periode03/11/202005/11/2020

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Reassessing Claims of Human Parity and Super-Human Performance in Machine Translation at WMT 2019'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit