Abstract
In this paper we explore the variation of sentences as a function of the sentence number. We demonstrate that while the entropy of the sentence increases with the sentence number, it decreases at the paragraph boundaries in accordance with the Entropy Rate Constancy principle (introduced in related work). We also demonstrate that the principle holds for different genres and languages and explore the role of genre informativeness. We investigate potential causes of entropy variation by looking at the tree depth, the branching factor, the size of constituents, and the occurrence of gapping.
Cite
CITATION STYLE
Genzel, D., & Charniak, E. (2003). Variation of Entropy and Parse Trees of Sentences as a Function of the Sentence Number. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003 (pp. 65–72). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1119355.1119364
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.