Twitter-based Polarised Embeddings for Abusive Language Detection

Leon Graumans, Roy David, Tommaso Caselli

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    7 Citations (Scopus)
    156 Downloads (Pure)

    Abstract

    We present a method to generate polarised word embeddings using controversial topics as search terms in Twitter as proxies for interactions among social media communities that may be liable to use abusive language. We investigate to what extent models trained with these embeddings perform with respect to generic embeddings across four data sets of abusive language, both in the same domain and out of domain, using simple linear classifiers. Our results show that the polarised embeddings are competitive in the same domain data sets, and perform better in out of domain one.
    Original languageEnglish
    Title of host publication2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)
    PublisherIEEE
    Number of pages8
    ISBN (Electronic)978-1-7281-3891-6
    ISBN (Print)978-1-7281-3892-3
    DOIs
    Publication statusPublished - Sept-2019

    Fingerprint

    Dive into the research topics of 'Twitter-based Polarised Embeddings for Abusive Language Detection'. Together they form a unique fingerprint.

    Cite this