DALC - Dutch Abusive Language Corpus

  • Tommaso Caselli (Creator)
  • Marieke Weultjes (Creator)
  • Arjan Schelhaas (Creator)
  • Folkert Leistra (Creator)
  • Hylke van der Veen (Creator)
  • Menno Robben (Creator)
  • Gerben Timmerman (Creator)
  • Victor Zwart (Creator)
  • Robin van der Noord (Creator)
  • Zhenja Gnezdilov (Creator)
  • Dion Theodoridis (Creator)

Dataset

Description

This repository contains the full text format of DALC structured as follows: - unique numeric id of the message - full text message anonymised - annotated data for abusive language - annotated data for offensive language Full description of the dataset and accompanying data statement is available at https://github.com/tommasoc80/DALC
Datum van beschikbaarheid23-mrt.-2023
UitgeverDataverseNL
  • DALC: the Dutch Abusive Language Corpus

    Caselli, T., Schelhaas, A., Weultjes, M., Leistra, F., van der Veen, H., Timmerman, G. & Nissim, M., 27-jul.-2021, Proceedings of the 5th Workshop on Online Abuse and Harm. Mostafazadeh Davani, A., Kiela, D., Lambert, M., Vidgen, B., Prabhakaran, V. & Waseem, Z. (uitgave). Association for Computational Linguistics (ACL), blz. 54-66 13 blz.

    OnderzoeksoutputAcademicpeer review

    Open Access
    Bestand
    123 Downloads (Pure)

Citeer dit