Increasing return on annotation investment: the automatic construction of a Universal Dependency treebank for Dutch

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    12 Citations (Scopus)
    89 Downloads (Pure)

    Abstract

    We present a method for automatically converting the Dutch Lassy Small treebank, a phrasal dependency treebank, to UD. All of the information required to produce accurate UD annotation appears to be available in the underlying annotation. However, we also note that the close connection between POS-tags and dependency labels that is present in UD is missing in the Lassy treebanks. As a consequence, annotation decisions in the Dutch
    data for such phenomena as nominalization
    and clausal complements of prepositions
    seem to differ to some extent from comparable data in English and German. Because the conversion is automatic, we can now also compare three state-of-theart dependency parsers trained on UD Lassy Small with Alpino, a hybrid Dutch parser which produces output that is compatible with the original Lassy annotations.
    Original languageEnglish
    Title of host publicationProceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017)
    Pages19-26
    Number of pages8
    Publication statusPublished - May-2017

    Cite this