Discourse analysis has long been known to be fundamental in natural language processing. In this research, we present our insight on discourse-level topic chain (DTC) parsing which aims at discovering new topics and investigating how these topics evolve over time within an article. To address the lack of data, we contribute a new discourse corpus with DTC-style dependency graphs annotated upon news articles. In particular, we ensure the high reliability of the corpus by utilizing a two-step annotation strategy to build the data and filtering out the annotations with low confidence scores. Based on the annotated corpus, we introduce a simple yet robust system for automatic discourse-level topic chain parsing.
CITATION STYLE
Zhang, L., Tan, X., Kong, F., & Zhou, G. (2021). EDTC: A Corpus for Discourse-Level Topic Chain Parsing. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 1304–1312). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.113
Mendeley helps you to discover research relevant for your work.