An empirical approach to discourse markers by clustering

Laura Alonso; Irene Castellón; Karina Gibert; Lluís Padró

Conference Proceedings

An empirical approach to discourse markers by clustering

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2002) 2504 173-183

DOI: 10.1007/3-540-36079-4_15

10Citations

5Readers

Get full text

Abstract

The problem of capturing discourse structure for complex NLP tasks has often been addressed by exploiting surface clues that can yield a partial structure of discourse. Discourse Markers (DMs) are among the most popular of these clues because they are both highly informative of discourse structure and have a very low processing cost. However, they present two main problems: first, there is a general lack of consensus about their appropriate characterisation for NLP applications, and secondly, their potential as an unexpensive source of discourse knowledge is weakened by the fact that information associated to them is usually hand-encoded. In this paper we will show how a combination of clustering techniques provides empirical evidence for a characterisation of DMs. This data-driven methodology provides generalisations helpful for reducing the cost of encoding the information associated to DMs, while increasing consistency of their characterisation.

Cite

CITATION STYLE

APA

Alonso, L., Castellón, I., Gibert, K., & Padró, L. (2002). An empirical approach to discourse markers by clustering. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2504, pp. 173–183). Springer Verlag. https://doi.org/10.1007/3-540-36079-4_15

An empirical approach to discourse markers by clustering

Abstract

Cite

Register to see more suggestions