Predicting citations in Dutch case law with natural language processing

Iris Schepers*, Masha Medvedeva, Michelle Bruijn, Martijn Wieling, Michel Vols

*Corresponding author voor dit werk

Onderzoeksoutput: ArticleAcademicpeer review

49 Downloads (Pure)

Samenvatting

With the ever-growing accessibility of case law online, it has become challenging
to manually identify case law relevant to one’s legal issue. In the Netherlands, the planned increase in the online publication of case law is expected to exacerbate this challenge. In this paper, we tried to predict whether court decisions are cited by other courts or not after being published, thus in a way distinguishing between more and less authoritative cases. This type of system may be used to process the large amounts of available data by filtering out large quantities of non-authoritative decisions, thus helping legal practitioners and scholars to find relevant decisions more easily, and drastically reducing the time spent on preparation and analysis. For the Dutch Supreme Court, the match between our prediction and the actual data was relatively strong (with a Matthews Correlation Coefficient of 0.60). Our results were less successful for the Council of State and the district courts (MCC scores of 0.26 and 0.17, relatively). We also attempted to identify the most informative characteristics of a decision. We found that a completely explainable model, consisting only of handcrafted metadata features, performs almost as well as a less well-explainable system based on all text of the decision.
Originele taal-2English
Pagina's (van-tot)807-837
Aantal pagina's31
TijdschriftArtificial Intelligence and Law
Volume32
Vroegere onlinedatum28-jun.-2023
DOI's
StatusPublished - sep.-2024

Vingerafdruk

Duik in de onderzoeksthema's van 'Predicting citations in Dutch case law with natural language processing'. Samen vormen ze een unieke vingerafdruk.

Citeer dit