Samenvatting
Scarcity of parallel data causes formality style transfer models to have scarce success in preserving content. We show that fine-tuning pre-trained language (GPT-2) and sequence-to-sequence (BART) models boosts content preservation, and that this is possible even with limited amounts of parallel data. Augmenting these models with rewards that target style and content –the two core aspects of the task– we achieve a new state-of-the-art.
Originele taal-2 | English |
---|---|
Titel | Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing |
Redacteuren | Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli |
Plaats van productie | Bangkok, Thailand |
Uitgeverij | Association for Computational Linguistics, ACL Anthology |
Pagina's | 484-494 |
Aantal pagina's | 11 |
Volume | 2 |
DOI's | |
Status | Published - 2021 |