Large-scale Cross-lingual Language Resources for Referencing and Framing

Piek Vossen*, Filip Ilievski, Marten Postma, Antske Fokkens, Gosse Minnema, Levi Remijnse

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    1 Downloads (Pure)

    Abstract

    In this article, we lay out the basic ideas and principles of the project Framing Situations in the Dutch Language. We provide our first results of data acquisition, together with the first data release. We introduce the notion of cross-lingual referential corpora. These corpora consist of texts that make reference to exactly the same incidents. The referential grounding allows us to analyze the framing of these incidents in different languages and across different texts. During the project, we will use the automatically generated data to study linguistic framing as a phenomenon, build framing resources such as lexicons and corpora. We expect to capture larger variation in framing compared to traditional approaches for building such resources. Our first data release, which contains structured data about a large number of incidents and reference texts, can be found at http://dutchframenet. nl/data-releases/.
    Original languageEnglish
    Title of host publicationProceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
    Place of PublicationMarseille, France
    PublisherEuropean Language Resources Association (ELRA)
    Pages3162-3171
    Number of pages10
    Publication statusPublished - May-2020
    Event12th Language Resources and Evaluation Conference
    : LREC 2020
    - Marseille, France
    Duration: 11-May-202016-May-2020
    https://lrec2020.lrec-conf.org/en/

    Conference

    Conference12th Language Resources and Evaluation Conference
    CountryFrance
    CityMarseille
    Period11/05/202016/05/2020
    Internet address

    Cite this