Abstract
Current approaches to automatic summarization of scientific papers generate informative summaries in the form of abstracts. However, abstracts are not intended to show the relationship between a paper and the references cited in it. We propose a new contextualized summarization approach that can generate an informative summary conditioned on a given sentence containing the citation of a reference (a so-called “citance”). This summary outlines the content of the cited paper relevant to the citation location. Thus, our approach extracts and models the citances of a paper, retrieves relevant passages from cited papers, and generates abstractive summaries tailored to each citance. We evaluate our approach using WEBIS-CONTEXT-SCISUMM-2023, a new dataset containing 540K computer science papers and 4.6M citances therein.
Original language | English |
---|---|
Title of host publication | Findings of the Association for Computational Linguistics |
Subtitle of host publication | EMNLP 2023 |
Editors | Houda Bouamor, Juan Pino, Kalika Bali |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 8551-8568 |
Number of pages | 18 |
ISBN (Electronic) | 9798891760615 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) - Singapore, Singapore Duration: 6-Dec-2023 → 10-Dec-2023 |
Conference
Conference | 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 06/12/2023 → 10/12/2023 |