Putting Dutchcoref to the Test: Character Detection and Gender Dynamics in Contemporary Dutch Novels

Joris van Zundert, Andreas van Cranenburgh, Roel Smeets

OnderzoeksoutputAcademicpeer review

75 Downloads (Pure)

Samenvatting

Although coreference resolution is a necessary step for a wide range of automated narratological analyses, most of the systems performing this task leave much to be desired in terms of either accuracy or their practical application in literary studies. While there are coreference resolution systems that demonstrate good performance on annotated fragments of novels, evaluations typically do not consider performance on the full texts of novels. In order to optimize its output for concrete use in Dutch literary studies, we are in the process of evaluating and finetuning Dutchcoref. Dutchcoref is an implementation of the Stanford Multi-Pass Sieve Coreference System for Dutch. Using a “silver standard” of annotated data on 2,137 characters in 170 contemporary Dutch novels, we assess the extent to which Dutchcoref is
able to identify the most prominent characters and their gender. Furthermore, we explore the usability of the system by exploring a specific narratological question about the gender distribution of the characters. We find that Dutchcoref is highly accurate in detecting noun phrases, proper names, and pronouns referring to characters, and that it is accurate in establishing their gender. However, the ability to cluster co-references together in a character profile, which we compare to BookNLP’s performance in this respect, is still sub-optimal and deteriorates with text length. We show that, notwithstanding current state of development, Dutchcoref can be applied for meaningful literary analysis, and we outline future
prospects.
Originele taal-2English
TitelProceedings of the Computational Humanities Research conference 2023
RedacteurenArtjoms Šeļa, Fotis Jannidis, Iza Romanowska
Plaats van productieParis, France
UitgeverijCEUR Workshop Proceedings (CEUR-WS.org)
Pagina's757-771
Aantal pagina's15
StatusPublished - 2023
EvenementComputational Humanities Research Conference - Paris, France
Duur: 6-dec.-20238-dec.-2023

Conference

ConferenceComputational Humanities Research Conference
Land/RegioFrance
StadParis
Periode06/12/202308/12/2023

Vingerafdruk

Duik in de onderzoeksthema's van 'Putting Dutchcoref to the Test: Character Detection and Gender Dynamics in Contemporary Dutch Novels'. Samen vormen ze een unieke vingerafdruk.

Citeer dit