Natural Language Processing for Ancient Greek: Design, advantages and challenges of language models

Silvia Stopponi, Nilo Pedrazzini, Saskia Peels-Matthey, Barbara McGillivray, Malvina Nissim

Research output: Contribution to journalArticleAcademicpeer-review

1 Citation (Scopus)
127 Downloads (Pure)

Abstract

Computational methods have produced meaningful and usable results to study word semantics, including semanticchange. These methods, belonging to the field of Natural Language Processing, have recently been applied to ancient languages; inparticular, language modelling has been applied to Ancient Greek, the language on which we focus. In this contribution we explainhow vector representations can be computed from word co-occurrences in a corpus and can be used to locate words in a semantic space,and what kind of semantic information can be extracted from language models. We compare three different kinds of language modelsthat can be used to study Ancient Greek semantics: a count-based model, a word embedding model and a syntactic embedding model;and we show examples of how the quality of their representations can be assessed. We highlight the advantages and potential ofthese methods, especially for the study of semantic change, together with their limitations.
Original languageEnglish
Pages (from-to)414-435
Number of pages22
JournalDiachronica
Volume41
Issue number3
Early online date2-Jul-2024
DOIs
Publication statusPublished - Oct-2024

Keywords

  • Ancient Greek
  • computational
  • semantic change
  • lanaguage modelling
  • Natural language processing
  • word embeddings
  • semantic space

Fingerprint

Dive into the research topics of 'Natural Language Processing for Ancient Greek: Design, advantages and challenges of language models'. Together they form a unique fingerprint.

Cite this