Genre distinctions for Discourse in the Penn TreeBank

Bonnie Webber

Conference Proceedings

Genre distinctions for Discourse in the Penn TreeBank

Webber B

ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (2009) 674-682

DOI: 10.3115/1690219.1690240

64Citations

143Readers

Get full text

Abstract

Articles in the Penn TreeBank were identified as being reviews, summaries, letters to the editor, news reportage, corrections, wit and short verse, or quarterly profit reports. All but the latter three were then characterised in terms of features manually annotated in the Penn Discourse TreeBank — discourse connectives and their senses. Summaries turned out to display very different discourse features than the other three genres. Letters also appeared to have some different features. The two main findings involve (1) differences between genres in the senses associated with intra-sentential discourse connectives, inter-sentential discourse connectives and inter-sentential discourse relations that are not lexically marked; and (2) differences within all four genres between the senses of discourse relations not lexically marked and those that are marked. The first finding means that genre should be made a factor in automated sense labelling of non-lexically marked discourse relations. The second means that lexically marked relations provide a poor model for automated sense labelling of relations that are not lexically marked.

Cite

CITATION STYLE

APA

Webber, B. (2009). Genre distinctions for Discourse in the Penn TreeBank. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 674–682). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1690219.1690240

Genre distinctions for Discourse in the Penn TreeBank

Abstract

Cite

Register to see more suggestions