A Noisy-Channel Model for Document Compression

59Citations
Citations of this article
132Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a document compression system that uses a hierarchical noisy-channel model of text production. Our compression system first automatically derives the syntactic structure of each sentence and the overall discourse structure of the text given as input. The system then uses a statistical hierarchical model of text production in order to drop non-important syntactic and discourse constituents so as to generate coherent, grammatical document compressions of arbitrary length. The system outperforms both a baseline and a sentence-based compression system that operates by simplifying sequentially all sentences in a text. Our results support the claim that discourse knowledge plays an important role in document summarization.

Cite

CITATION STYLE

APA

Daumé, H., & Marcu, D. (2002). A Noisy-Channel Model for Document Compression. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2002-July, pp. 449–456). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073083.1073159

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free