Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection

  • Sporleder C
N/ACitations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

We explore the task of predicting which discourse relation holds between two text spans in which the relation is not signalled by an unambiguous discourse marker. It has been proposed that automatically labelled data, which can be derived from examples in which a discourse relation is unambiguously signalled, could be used to train a machine learner to perform this task reasonably well. However, more recent results suggest that there are problems with this approach, probably due to the fact that the automatically labelled data has particular properties which are not shared by the data to which the classifier is then applied. We investigate how big this problem really is and whether the unrepresentativeness of the automatically labelled data can be overcome by performing automatic example and feature selection."

Cite

CITATION STYLE

APA

Sporleder, C. (2007). Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection. Journal for Language Technology and Computational Linguistics, 22(1), 1–20. https://doi.org/10.21248/jlcl.22.2007.86

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free