SumDoCs: Surrounding-aware unsupervised multi-document summarization

4Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multi-document summarization, which summarizes a set of documents with a small number of phrases or sentences, provides a concise and critical essence of the documents. Existing multi-document summarization methods ignore the fact that there often exist many relevant documents that provide surrounding background knowledge, which can help generate a salient and discriminative summary for a given set of documents. In this paper, we propose a novel method, SUMDocS (Surrounding-aware Unsupervised Multi-Document Summarization), which incorporates rich surrounding (topically related) documents to help improve the quality of extractive summarization without human supervision. Specifically, we propose a joint optimization algorithm to unify global novelty (i.e., category-level frequent and discriminative), local consistency (i.e., locally frequent, co-occurring), and local saliency (i.e., salient from its surroundings) such that the obtained summary captures the characteristics of the target documents. Extensive experiments on news and scientific domains demonstrate the superior performance of our method when the unlabeled surrounding corpus is utilized.

Cite

CITATION STYLE

APA

Zhu, Q., Guo, F., Tian, J., Mao, Y., & Han, J. (2021). SumDoCs: Surrounding-aware unsupervised multi-document summarization. In SIAM International Conference on Data Mining, SDM 2021 (pp. 477–485). Siam Society. https://doi.org/10.1137/1.9781611976700.54

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free