Abstract
We present a method for automatically converting the Dutch Lassy Small treebank, a phrasal dependency treebank, to UD. All of the information required to produce accurate UD annotation appears to be available in the underlying annotation. However, we also note that the close connection between POS-tags and dependency labels that is present in UD is missing in the Lassy treebanks. As a consequence, annotation decisions in the Dutch
data for such phenomena as nominalization
and clausal complements of prepositions
seem to differ to some extent from comparable data in English and German. Because the conversion is automatic, we can now also compare three state-of-theart dependency parsers trained on UD Lassy Small with Alpino, a hybrid Dutch parser which produces output that is compatible with the original Lassy annotations.
data for such phenomena as nominalization
and clausal complements of prepositions
seem to differ to some extent from comparable data in English and German. Because the conversion is automatic, we can now also compare three state-of-theart dependency parsers trained on UD Lassy Small with Alpino, a hybrid Dutch parser which produces output that is compatible with the original Lassy annotations.
Original language | English |
---|---|
Title of host publication | Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017) |
Pages | 19-26 |
Number of pages | 8 |
Publication status | Published - May-2017 |