Abstract
The weighted kappa coefficient is commonly used for assessing agreement between two raters on an ordinal scale. This study is the first to assess the impact of missing data on the value of weighted kappa. We compared three methods for handling missing data in a simulation study: predictive mean matching, listwise deletion and a weighted version of Gwet’s kappa. We compared their performances under three missing data mechanisms, using agreement tables with various numbers of categories and different values of weighted kappa. Predictive mean matching outperformed the other two methods in most simulated cases in terms of root mean squared error and in all cases in terms of bias.
Original language | English |
---|---|
Article number | 18 |
Number of pages | 19 |
Journal | Machine Learning & Knowledge Extraction |
Volume | 7 |
Issue number | 1 |
DOIs | |
Publication status | Published - Mar-2025 |