Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations

2Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

Discourse analysis is an important task because it models intrinsic semantic structures between sentences in a document. Discourse markers are natural representations of discourse in our daily language. One challenge is that the markers as well as pre-defined and human-labeled discourse relations can be ambiguous when describing the semantics between sentences. We believe that a better approach is to use a contextual-dependent distribution over the markers to express discourse information. In this work, we propose to learn a Distributed Marker Representation (DMR) by utilizing the (potentially) unlimited discourse marker data with a latent discourse sense, thereby bridging markers with sentence pairs. Such representations can be learned automatically from data without supervision, and in turn provide insights into the data itself. Experiments show the SOTA performance of our DMR on the implicit discourse relation recognition task and strong interpretability. Our method also offers a valuable tool to understand complex ambiguity and entanglement among discourse markers and manually defined discourse relations.

Cite

CITATION STYLE

APA

Ru, D., Qiu, L., Qiu, X., Zhang, Y., & Zhang, Z. (2023). Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 5334–5351). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.292

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free