Simple Embedding-Based Word Sense Disambiguation

Dieke Oele, Gertjan van Noord

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

43 Downloads (Pure)

Abstract

We present a simple knowledge-based WSD method that uses word and sense embeddings to compute the similarity between the gloss of a sense and the context of the word. Our method is inspired by the Lesk algorithm as it exploits both the context of the words and the definitions of the senses. It only requires large unlabeled corpora and a sense inventory such as WordNet, and therefore does not rely on annotated data. We explore whether additional extensions to Lesk are compatible with our method. The results of our experiments show that by lexically extending the amount of words in the gloss and context, although it works well for other implementations of Lesk, harms our method. Using a lexical selection method on the context words, on the other hand, improves it. The combination of our method with lexical selection enables our method to outperform state-of the art knowledge-based systems.
Original languageEnglish
Title of host publicationProceedings of the 9th Global Wordnet Conference
EditorsFrancis Bond, Piek Vossen, Christiane Fellbaum
PublisherAssociation for Computational Linguistics (ACL)
Pages259-265
Number of pages7
Publication statusPublished - 2018

Fingerprint

Dive into the research topics of 'Simple Embedding-Based Word Sense Disambiguation'. Together they form a unique fingerprint.

Cite this