Simple Embedding-Based Word Sense Disambiguation

Dieke Oele, Gertjan van Noord

Onderzoeksoutput: Conference contributionAcademicpeer review

43 Downloads (Pure)

Samenvatting

We present a simple knowledge-based WSD method that uses word and sense embeddings to compute the similarity between the gloss of a sense and the context of the word. Our method is inspired by the Lesk algorithm as it exploits both the context of the words and the definitions of the senses. It only requires large unlabeled corpora and a sense inventory such as WordNet, and therefore does not rely on annotated data. We explore whether additional extensions to Lesk are compatible with our method. The results of our experiments show that by lexically extending the amount of words in the gloss and context, although it works well for other implementations of Lesk, harms our method. Using a lexical selection method on the context words, on the other hand, improves it. The combination of our method with lexical selection enables our method to outperform state-of the art knowledge-based systems.
Originele taal-2English
TitelProceedings of the 9th Global Wordnet Conference
RedacteurenFrancis Bond, Piek Vossen, Christiane Fellbaum
UitgeverijAssociation for Computational Linguistics (ACL)
Pagina's259-265
Aantal pagina's7
StatusPublished - 2018

Vingerafdruk

Duik in de onderzoeksthema's van 'Simple Embedding-Based Word Sense Disambiguation'. Samen vormen ze een unieke vingerafdruk.

Citeer dit