We explore the task of predicting which discourse relation holds between two text spans in which the relation is not signalled by an unambiguous discourse marker. It has been proposed that automatically labelled data, which can be derived from examples in which a discourse relation is unambiguously signalled, could be used to train a machine learner to perform this task reasonably well. However, more recent results suggest that there are problems with this approach, probably due to the fact that the automatically labelled data has particular properties which are not shared by the data to which the classifier is then applied. We investigate how big this problem really is and whether the unrepresentativeness of the automatically labelled data can be overcome by performing automatic example and feature selection."
CITATION STYLE
Sporleder, C. (2007). Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection. Journal for Language Technology and Computational Linguistics, 22(1), 1–20. https://doi.org/10.21248/jlcl.22.2007.86
Mendeley helps you to discover research relevant for your work.