Abstract
We present a method to generate polarised word embeddings using controversial topics as search terms in Twitter as proxies for interactions among social media communities that may be liable to use abusive language. We investigate to what extent models trained with these embeddings perform with respect to generic embeddings across four data sets of abusive language, both in the same domain and out of domain, using simple linear classifiers. Our results show that the polarised embeddings are competitive in the same domain data sets, and perform better in out of domain one.
Original language | English |
---|---|
Title of host publication | 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW) |
Publisher | IEEE |
Number of pages | 8 |
ISBN (Electronic) | 978-1-7281-3891-6 |
ISBN (Print) | 978-1-7281-3892-3 |
DOIs | |
Publication status | Published - Sept-2019 |