Extracting and classifying exceptional COVID-19 measures from multilingual legal texts: The merits and limitations of automated approaches

Clara Egger*, Tommaso Caselli, Georgios Tziafas, Eugénie de Saint Phalle, Wietse de Vries

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

1 Citation (Scopus)
60 Downloads (Pure)

Abstract

This paper contributes to ongoing scholarly debates on the merits and limitations of computational legal text analysis by reflecting on the results of a research project documenting exceptional COVID-19 management measures in Europe. The variety of exceptional measures adopted in countries characterized by different legal systems and natural languages, as well as the rapid evolution of such measures, pose considerable challenges to manual textual analysis methods traditionally used in the social sciences. To address these challenges, we develop a supervised classifier to support the manual coding of exceptional policies by a multinational team of human coders. After presenting the results of various natural language processing (NLP) experiments, we show that human-in-the-loop approaches to computational text analysis outperform unsupervised approaches in accurately extracting policy events from legal texts. We draw lessons from our experience to ensure the successful integration of NLP methods into social science research agendas.

Original languageEnglish
Pages (from-to)704-723
Number of pages20
JournalRegulation and Governance
Volume18
Issue number3
Early online date2-Oct-2023
DOIs
Publication statusPublished - Jul-2024

Keywords

  • computational text analysis
  • COVID-19
  • crisis governance
  • law-as-data
  • natural language processing

Fingerprint

Dive into the research topics of 'Extracting and classifying exceptional COVID-19 measures from multilingual legal texts: The merits and limitations of automated approaches'. Together they form a unique fingerprint.

Cite this