Comparison of methods for explicit discourse connective identification across various domains

Merel C.J. Scholman; Tianai Dong; Frances Yung; Vera Demberg

Conference Proceedings

Comparison of methods for explicit discourse connective identification across various domains

2nd Workshop on Computational Approaches to Discourse, CODI 2021 - Proceedings of the Workshop (2021) 95-106

DOI: 10.18653/v1/2021.codi-main.9

11Citations

48Readers

Get full text

Abstract

Existing parse methods use varying approaches to identify explicit discourse connectives, but their performance has not been consistently evaluated in comparison to each other, nor have they been evaluated consistently on text other than newspaper articles. We here assess the performance on explicit connective identification of four parse methods (PDTB e2e, Lin et al., 2014; the winner of CONLL2015, Wang and Lan, 2015; DisSent, Nie et al., 2019; and Discopy, Knaebel and Stede, 2020), along with a simple heuristic. We also examine how well these systems generalize to different datasets, namely newspaper text (PDTB), scientific text (BioDRB), prepared spoken text (TED-MDB) and spontaneous spoken text (Disco-SPICE). The results show that Discopy outperforms the other parse methods in all datasets, with the exception of DiscoSPICE. Moreover, performance drops significantly from the PDTB to all other datasets. We provide a more fine-grained analysis of domain differences and connectives that prove difficult to parse, in order to highlight the areas where gains can be made.

Cite

CITATION STYLE

APA

Scholman, M. C. J., Dong, T., Yung, F., & Demberg, V. (2021). Comparison of methods for explicit discourse connective identification across various domains. In 2nd Workshop on Computational Approaches to Discourse, CODI 2021 - Proceedings of the Workshop (pp. 95–106). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.codi-main.9

Comparison of methods for explicit discourse connective identification across various domains

Abstract

Cite

Register to see more suggestions