Abstract
The PageRank model has been successfully exploited for multi-document summarization by making use of the link relationships between sentences in the document set, under the assumption that all the sentences are indistinguishable from each other. However, different documents in the set are usually not equally important, and the sentences in an important document are deemed more salient than the sentences in a trivial document. This paper proposes the document-based HITS model (DocHITS) to fully leverage the document-level information by considering documents and sentences as hubs and authorities. Experimental results on the DUC2001 and DUC2002 datasets demonstrate the good effectiveness of our proposed model. © 2008 Springer Berlin Heidelberg.
Cite
CITATION STYLE
Wan, X. (2008). Document-based HITS model for multi-document summarization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5351 LNAI, pp. 454–465). https://doi.org/10.1007/978-3-540-89197-0_42
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.