NCC: Neural concept compression for multilingual document recommendation[Formula presented]

Tsegaye Misikir Tashu*, Marc Lenz, Tomáš Horváth

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

1 Citation (Scopus)
111 Downloads (Pure)

Abstract

In this work, we propose a novel method for generating inter-lingual document representations using neural network concept compression. The presented approach is intended to improve the quality of content-based multilingual document recommendation and information retrieval systems by creating a language-independent representation. The main idea is to use mappings to align monolingual representation spaces, using concept compression, to create inter-lingual representations. The proposed approach outperforms traditional cross-lingual retrieval and recommendations methods in experiments conducted on JRC-Acquis and EU bookshop multilingual corpora. Our dataset and code are publicly available at https://github.com/Tsegaye-misikir/NCC.

Original languageEnglish
Article number110348
Number of pages10
JournalApplied Soft Computing
Volume142
DOIs
Publication statusPublished - Jul-2023

Keywords

  • Cross-lingual representation
  • Document representation
  • Information retrieval
  • Multi-lingual recommendation
  • Natural language processing

Fingerprint

Dive into the research topics of 'NCC: Neural concept compression for multilingual document recommendation[Formula presented]'. Together they form a unique fingerprint.

Cite this