Interpreting Neural Language Models for Linguistic Complexity Assessment


100 Downloads (Pure)


The study of linguistic complexity is a deeply multidisciplinary area, ranging from the study of cognitive processing in human readers to the classification of structural complexity characterizing expressions in natural language. In recent times, the use of computational methods for the processing and analysis of language has produced important developments in the understanding of multiple phenomena associated with linguistic complexity. In line with the state of the art in the sector, this thesis presents a model-driven study of multiple phenomena associated with linguistic complexity. First, relationships subsisting between various extrinsic complexity metrics - perception of linguistic complexity, readability, cognitive processing and predictability - are empirically explored, highlighting similarities and differences from a linguistically and cognitively motivated perspective. Then, it is studied how the information underlying the different complexity metrics can be acquired by language models based on neural networks, at various levels of abstraction and granularity, using interpretability techniques derived from the literature on natural language processing. In conclusion, the ability of various computational models of complexity in predicting cognitive processing difficulties associated with atypical syntactic constructs, such as garden-path sentences, is evaluated. The experimental results of this study provide convergent evidence regarding the limited abstraction and generalization abilities of state-of-the-art neural language models for the prediction of linguistic complexity, and encourage the adoption of lines of research that integrate symbolic and interpretable information in this sector. Code is available at
Originele taal-2English
KwalificatieMaster of Science
Toekennende instantie
  • University of Trieste
  • Dell'Orletta, Felice, Supervisor, Externe Persoon
  • Crepaldi, Davide, Supervisor, Externe Persoon
Datum van toekenning11-dec.-2020
StatusPublished - 11-dec.-2020
Extern gepubliceerdJa

Citeer dit