Activities per year
Abstract
With the ever-growing accessibility of case law online, it has become challenging
to manually identify case law relevant to one’s legal issue. In the Netherlands, the planned increase in the online publication of case law is expected to exacerbate this challenge. In this paper, we tried to predict whether court decisions are cited by other courts or not after being published, thus in a way distinguishing between more and less authoritative cases. This type of system may be used to process the large amounts of available data by filtering out large quantities of non-authoritative decisions, thus helping legal practitioners and scholars to find relevant decisions more easily, and drastically reducing the time spent on preparation and analysis. For the Dutch Supreme Court, the match between our prediction and the actual data was relatively strong (with a Matthews Correlation Coefficient of 0.60). Our results were less successful for the Council of State and the district courts (MCC scores of 0.26 and 0.17, relatively). We also attempted to identify the most informative characteristics of a decision. We found that a completely explainable model, consisting only of handcrafted metadata features, performs almost as well as a less well-explainable system based on all text of the decision.
to manually identify case law relevant to one’s legal issue. In the Netherlands, the planned increase in the online publication of case law is expected to exacerbate this challenge. In this paper, we tried to predict whether court decisions are cited by other courts or not after being published, thus in a way distinguishing between more and less authoritative cases. This type of system may be used to process the large amounts of available data by filtering out large quantities of non-authoritative decisions, thus helping legal practitioners and scholars to find relevant decisions more easily, and drastically reducing the time spent on preparation and analysis. For the Dutch Supreme Court, the match between our prediction and the actual data was relatively strong (with a Matthews Correlation Coefficient of 0.60). Our results were less successful for the Council of State and the district courts (MCC scores of 0.26 and 0.17, relatively). We also attempted to identify the most informative characteristics of a decision. We found that a completely explainable model, consisting only of handcrafted metadata features, performs almost as well as a less well-explainable system based on all text of the decision.
Original language | English |
---|---|
Pages (from-to) | 807-837 |
Number of pages | 31 |
Journal | Artificial Intelligence and Law |
Volume | 32 |
Early online date | 28-Jun-2023 |
DOIs | |
Publication status | Published - Sept-2024 |
Keywords
- machine learning
- case law
- natural language processing
- citation analysis
- judicial decisions
Fingerprint
Dive into the research topics of 'Predicting citations in Dutch case law with natural language processing'. Together they form a unique fingerprint.Activities
- 1 Academic presentation
-
Predicting citations in Dutch case law with natural language processing
Schepers, I. (Speaker)
2-Mar-2022Activity: Talk and presentation › Academic presentation › Academic
Projects
- 1 Active
-
EVICT : The Impact of the International Right to Housing on National Legal Discourse: Using Data Science Techniques to Analyse Eviction Litigation
Vols, M. (PI), Wieling, M. (Researcher), Bruijn, M. (Postdoc), Hoops, B. (CoPI), Roorda, B. (Member), Mohammadi, M. (Postdoc), Quintiá Pastrana, A. (Postdoc), Whitehouse, L. (Advisor), Schmid, C. (Advisor), van Dijck, G. (Advisor), Arts, K. (Advisor), Prakken, H. (Advisor), de Vries, J. (Advisor) & Zauner-Lohmeyer, K. (Advisor)
01/01/2021 → …
Project: Research
File