Sound Change Estimation in Netherlandic Regional Languages: Reducing Inter-Transcriber Variability in Dialect Corpora

Raoul Sergio Samuel Jan Buurke*, Martijn Wieling

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

38 Downloads (Pure)

Abstract

Large phonetic corpora are frequently used to investigate language variation and change in dialects, but these corpora are often constructed by many researchers in a collaborative effort. This typically results in inter-transcriber issues that may impact the reliability of analyses using these data. This problem is exacerbated when multiple phonetic corpora are compared when investigating real time dialect change. In this study, we therefore propose a method to automatically and iteratively merge phonetic symbols used in the transcriptions to obtain a more coarse-grained, but better comparable, phonetic transcription. Our approach is evaluated using two large phonetic Netherlandic dialect corpora in an attempt to estimate sound change in the area in the 20th century. The results are discussed in the context of the available literature about dialect change in the Netherlandic area.
Original languageEnglish
Pages (from-to)7-28
Number of pages22
JournalTaal en Tongval
Volume75
Issue number1
DOIs
Publication statusPublished - 1-Sept-2023

Fingerprint

Dive into the research topics of 'Sound Change Estimation in Netherlandic Regional Languages: Reducing Inter-Transcriber Variability in Dialect Corpora'. Together they form a unique fingerprint.

Cite this