Using linguistically motivated features for paragraph boundary identification

Katja Filippova; Michael Strube

Conference Proceedings

Using linguistically motivated features for paragraph boundary identification

COLING/ACL 2006 - EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (2006) 267-274

DOI: 10.3115/1610075.1610114

15Citations

88Readers

Get full text

Abstract

In this paper we propose a machine-learning approach to paragraph boundary identification which utilizes linguistically motivated features. We investigate the relation between paragraph boundaries and discourse cues, pronominalization and information structure. We test our algorithm on German data and report improvements over three baselines including a reimplementation of Sporleder & Lapata's (2006) work on paragraph segmentation. An analysis of the features' contribution suggests an interpretation of what paragraph boundaries indicate and what they depend on. © 2006 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Filippova, K., & Strube, M. (2006). Using linguistically motivated features for paragraph boundary identification. In COLING/ACL 2006 - EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 267–274). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1610075.1610114

Using linguistically motivated features for paragraph boundary identification

Abstract

Cite

Register to see more suggestions