A New Dataset for Causality Identification in Argumentative Texts

Khalid Al-Khatib, Michael Völske, Shahbaz Syed, Anh Le, Martin Potthast, Benno Stein

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    70 Downloads (Pure)

    Abstract

    Existing datasets for causality identification in argumentative texts have several limitations, such as the type of input text (e.g., only claims), causality type (e.g., only positive), and the linguistic patterns investigated (e.g., only verb connectives). To resolve these limitations, we build the Webis-Causality-23 dataset, with sophisticated inputs (all units from arguments), a balanced distribution of causality types, and a larger number of linguistic patterns denoting causality. The dataset contains 1485 examples derived by combining the two paradigms of distant supervision and uncertainty sampling to identify diverse, high-quality samples of causality relations, and annotate them in a cost-effective manner.
    Original languageEnglish
    Title of host publicationProceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue
    EditorsSvetlana Stoyanchev, Shafiq Joty, David Schlangen, Ondrej Dusek, Casey Kennington, Malihe Alikhani
    PublisherAssociation for Computational Linguistics, ACL Anthology
    Pages349-354
    Number of pages6
    DOIs
    Publication statusPublished - 1-Sept-2023
    Event24th Annual Meeting of the Special Interest Group on Discourse and Dialogue - Prague, Czech Republic
    Duration: 11-Sept-202315-Sept-2023

    Conference

    Conference24th Annual Meeting of the Special Interest Group on Discourse and Dialogue
    Country/TerritoryCzech Republic
    CityPrague
    Period11/09/202315/09/2023

    Fingerprint

    Dive into the research topics of 'A New Dataset for Causality Identification in Argumentative Texts'. Together they form a unique fingerprint.

    Cite this