Abstract
We analyze the effect of further retraining BERT with different domain specific data as an unsupervised domain adaptation strategy for event extraction. Portability of event extraction models is particularly challenging, with large performance drops affecting data on the same text genres (eg, news). We present PROTEST-ER, a retrained BERT model for protest event extraction. PROTEST-ER outperforms a corresponding generic BERT on out-of-domain data of 8.1 points. Our best performing models reach 51.91-46.39 F1 across both domains.
Original language | English |
---|---|
Title of host publication | Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021) |
Editors | Ali Hürriyetoğlu |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 12-19 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 2021 |