SPOD: Syntactic Profiler of Dutch

Gertjan van Noord*, Jack Hoeksema, Peter Kleiweg, Gosse Bouma

*Corresponding author voor dit werk

OnderzoeksoutputAcademicpeer review

154 Downloads (Pure)

Samenvatting

SPOD is a tool for Dutch syntax in which a given corpus is analysed according to a large number of predefined syntactic characteristics. SPOD is an extension of the PaQu (”Parse and Query”) tool (Odijk et al. 2017). SPOD is available for a number of standard Dutch corpora and treebanks.
In addition, you can upload your own texts which will then be syntactically analysed. SPOD will run a potentially large number of syntactic queries in order to show a variety of corpus properties, such as the number of main and subordinate clauses, types of main and subordinate clauses, and their frequencies, average length of clauses (per clause type: e.g. relative clauses, indirect questions, finite complement clauses, infinitival clauses, finite adverbial clauses, etc.). Other syntactic constructions include comparatives, correlatives, various types of verb clusters, separable verb prefixes, depth of embedding etc.
SPOD allows linguists to obtain a quick overview of the syntactic properties of texts, for instance with the goal to find interesting differences between text types, or between authors with different backgrounds or different age. In the paper, we describe the SPOD tool in some more detail, and we provide a case study, illustrating the type of investigations which are enabled and
facilitated by SPOD. Most of the syntactic properties are implemented in SPOD by means of relatively complicated XPath 2.0 queries, and as such SPOD also provides examples of relevant syntactic queries, which may otherwise be relatively hard to define for non-technical linguists.
Originele taal-2English
Pagina's (van-tot)129-145
Aantal pagina's17
TijdschriftComputational Linguistics in the Netherlands Journal
Volume10
Nummer van het tijdschrift1
StatusPublished - dec.-2020

Vingerafdruk

Duik in de onderzoeksthema's van 'SPOD: Syntactic Profiler of Dutch'. Samen vormen ze een unieke vingerafdruk.

Citeer dit