This repository contains the full text format of DALC structured as follows: - unique numeric id of the message - full text message anonymised - annotated data for abusive language - annotated data for offensive language Full description of the dataset and accompanying data statement is available at https://github.com/tommasoc80/DALC
Caselli, T., Schelhaas, A., Weultjes, M., Leistra, F., van der Veen, H., Timmerman, G. & Nissim, M., 27-Jul-2021, Proceedings of the 5th Workshop on Online Abuse and Harm. Mostafazadeh Davani, A., Kiela, D., Lambert, M., Vidgen, B., Prabhakaran, V. & Waseem, Z. (eds.). Association for Computational Linguistics (ACL), p. 54-6613 p.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review
Open Access
File
129Downloads
(Pure)
Cite this
DataSetCite
Caselli, T. (Creator), Weultjes, M. (Creator), Schelhaas, A. (Creator), Leistra, F. (Creator), Van Der Veen, H. (Creator), Robben, M. (Creator), Timmerman, G. (Creator), Zwart, V. (Creator), Van Der Noord, R. (Creator), Gnezdilov, Z. (Creator), Theodoridis, D. (Creator) (23-Mar-2023). DALC - Dutch Abusive Language Corpus. DataverseNL. 10.34894/hoinl3