RuG at GermEval: Detecting Offensive Speech in German Social Media

Xiaoyu Bai, Flavio Merenda, Claudia Zaghi, Tomasso Caselli, Malvina Nissim

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    Abstract

    This paper reports on the systems the RuG Team submitted to the GermEval 2018 - Shared Task on the Identification of Offen- sive Language in German tweets. We sub- mitted three systems to Task 1, targeting the problem as a binary classification task, and only one system for Task 2, address- ing a fine-grained classification of offen- sive tweets in different categories. Prelim- inary evaluation of the systems has been conducted on a fixed validation set from the training data. The best macro-F1 score for Task 1, binary classification, is 75.45, obtained by an ensemble model composed by a Linear SVM, a CNN, and a Logistic Regressor as a meta-classifier. As for Task 2, multi-class classification, we obtained a macro-F1 of 40.75 using a multi-class Linear SVM.
    Original languageEnglish
    Title of host publicationProceedings of the GermEval 2018 Workshop
    EditorsJosef Ruppenhofer, Melanie Siegel, Michael Wiegand
    PublisherAustrian Academy of Sciences
    Pages63-70
    Number of pages8
    Publication statusPublished - 2018
    Event14th Conference on Natural Language Processing. KONVENS 2018 - Vienna, Austria
    Duration: 19-Sept-201821-Sept-2018

    Conference

    Conference14th Conference on Natural Language Processing. KONVENS 2018
    Country/TerritoryAustria
    CityVienna
    Period19/09/201821/09/2018

    Keywords

    • offensive language
    • benchmark

    Fingerprint

    Dive into the research topics of 'RuG at GermEval: Detecting Offensive Speech in German Social Media'. Together they form a unique fingerprint.

    Cite this