Sources, in the form of selected Facebook pages, can be used as indicators of hate-rich content. Polarized distributed representations created over such content prove superior to generic embeddings in the task of hate speech detection. The same content seems to carry a too weak signal to proxy silver labels in a distant supervised setting. However, this signal is stronger than gold labels which come from a different distribution, leading to re-think the process of annotation in the context of highly subjective judgments.
|Titel||Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018)|
|Redacteuren||Tomasso Caselli, Nicole Noviell, Viviana Patti, Paolo Rosso|
|Status||Published - 2018|
|Evenement||EVALITA 2018 - CLIC-It 2018, Turin, Italy|
Duur: 12-dec-2018 → 13-dec-2018
|ISSN van elektronische versie||1613-0073|
|Periode||12/12/2018 → 13/12/2018|