Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection

Caroline Sporleder

Journal ArticleOPEN ACCESS

Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection

Sporleder C

Journal for Language Technology and Computational Linguistics (2007) 22(1) 1-20

DOI: 10.21248/jlcl.22.2007.86

N/ACitations

14Readers

Abstract

We explore the task of predicting which discourse relation holds between two text spans in which the relation is not signalled by an unambiguous discourse marker. It has been proposed that automatically labelled data, which can be derived from examples in which a discourse relation is unambiguously signalled, could be used to train a machine learner to perform this task reasonably well. However, more recent results suggest that there are problems with this approach, probably due to the fact that the automatically labelled data has particular properties which are not shared by the data to which the classifier is then applied. We investigate how big this problem really is and whether the unrepresentativeness of the automatically labelled data can be overcome by performing automatic example and feature selection."

Cite

CITATION STYLE

APA

Sporleder, C. (2007). Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection. Journal for Language Technology and Computational Linguistics, 22(1), 1–20. https://doi.org/10.21248/jlcl.22.2007.86

Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection

Abstract

Cite

Register to see more suggestions