Co-clustering sentences and terms for multi-document summarization

Yunqing Xia; Yonggang Zhang; Jianmin Yao

Conference Proceedings

Co-clustering sentences and terms for multi-document summarization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6609 LNCS(PART 2) 339-352

DOI: 10.1007/978-3-642-19437-5_28

6Citations

10Readers

Get full text

Abstract

Two issues are crucial to multi-document summarization: diversity and redundancy. Content within some topically-related articles are usually redundant while the topic is delivered from diverse perspectives. This paper presents a co-clustering based multi-document summarization method that makes full use of the diverse and redundant content. A multi-document summary is generated in three steps. First, the sentence-term co-occurrence matrix is designed to reflect diversity and redundancy. Second, the co-clustering algorithm is performed on the matrix to find globally optimal clusters for sentences and terms in an iterative manner. Third, a more accurate summary is generated by selecting representative sentences from the optimal clusters. Experiments on DUC2004 dataset show that the co-clustering based multi-document summarization method is promising. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Xia, Y., Zhang, Y., & Yao, J. (2011). Co-clustering sentences and terms for multi-document summarization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6609 LNCS, pp. 339–352). https://doi.org/10.1007/978-3-642-19437-5_28

Co-clustering sentences and terms for multi-document summarization

Abstract

Author supplied keywords

Cite

Register to see more suggestions