RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

An, Chenxin; Zhong, Ming; Geng, Zhichao; Yang, Jianqiang; Qiu, Xipeng

Computer Science > Computation and Language

arXiv:2109.07943 (cs)

[Submitted on 16 Sep 2021 (v1), last revised 13 Dec 2021 (this version, v2)]

Title:RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

Authors:Chenxin An, Ming Zhong, Zhichao Geng, Jianqiang Yang, Xipeng Qiu

View PDF

Abstract:Existing summarization systems mostly generate summaries purely relying on the content of the source document. However, even for humans, we usually need some references or exemplars to help us fully understand the source document and write summaries in a particular format. But how to find the high-quality exemplars and incorporate them into summarization systems is still challenging and worth exploring. In this paper, we propose RetrievalSum, a novel retrieval enhanced abstractive summarization framework consisting of a dense Retriever and a Summarizer. At first, several closely related exemplars are retrieved as supplementary input to help the generation model understand the text more comprehensively. Furthermore, retrieved exemplars can also play a role in guiding the model to capture the writing style of a specific corpus. We validate our method on a wide range of summarization datasets across multiple domains and two backbone models: BERT and BART. Results show that our framework obtains significant improvement by 1.38~4.66 in ROUGE-1 score when compared with the powerful pre-trained models, and achieve new state-of-the-art on BillSum. Human evaluation demonstrates that our retrieval enhanced model can better capture the domain-specific writing style.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.07943 [cs.CL]
	(or arXiv:2109.07943v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.07943

Submission history

From: Chenxin An [view email]
[v1] Thu, 16 Sep 2021 12:52:48 UTC (4,713 KB)
[v2] Mon, 13 Dec 2021 12:57:06 UTC (4,736 KB)

Computer Science > Computation and Language

Title:RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators