SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

49Citations
Citations of this article
66Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Obtaining training data for multi-document Summarization (MDS) is time consuming and resource-intensive, so recent neural models can only be trained for limited domains. In this paper, we propose SummPip: an unsupervised method for multi-document summarization, in which we convert the original documents to a sentence graph, taking both linguistic and deep representation into account, then apply spectral clustering to obtain multiple clusters of sentences, and finally compress each cluster to generate the final summary. Experiments on Multi-News and DUC-2004 datasets show that our method is competitive to previous unsupervised methods and is even comparable to the neural supervised approaches. In addition, human evaluation shows our system produces consistent and complete summaries compared to human written ones.

Cite

CITATION STYLE

APA

Zhao, J., Liu, M., Gao, L., Jin, Y., Du, L., Zhao, H., … Haffari, G. (2020). SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1949–1952). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401327

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free