Abstract
Natural languages possess a wealth of indefinite forms that typically differ in distribution and interpretation. Although formal semanticists have strived to develop precise meaning representations for different indefinite functions, to date there has hardly been any corpus work on the topic. In this paper, we present the results of a small corpus study where English indefinite forms any and some were labelled with fine-grained semantic functions well-motivated by typological studies. We developed annotation guidelines that could be used by non-expert annotators and calculated inter-annotator agreement amongst several coders. The results show that the annotation task is hard, with agreement scores ranging from 52% to 62% depending on the number of functions considered, but also that each of the independent annotations is in accordance with theoretical predictions regarding the possible distributions
of indefinite functions. The resulting annotated corpus is available upon request and can be accessed through a searchable online database
of indefinite functions. The resulting annotated corpus is available upon request and can be accessed through a searchable online database
Original language | English |
---|---|
Title of host publication | The eighth international conference on Language Resources and Evaluation (LREC) |
Place of Publication | Istanbul, Turkey |
Publisher | European Language Resources Association (ELRA) |
Pages | 1511-1515 |
Number of pages | 5 |
Publication status | Published - May-2012 |
Externally published | Yes |