EDTC: A Corpus for Discourse-Level Topic Chain Parsing

0Citations
Citations of this article
39Readers
Mendeley users who have this article in their library.

Abstract

Discourse analysis has long been known to be fundamental in natural language processing. In this research, we present our insight on discourse-level topic chain (DTC) parsing which aims at discovering new topics and investigating how these topics evolve over time within an article. To address the lack of data, we contribute a new discourse corpus with DTC-style dependency graphs annotated upon news articles. In particular, we ensure the high reliability of the corpus by utilizing a two-step annotation strategy to build the data and filtering out the annotations with low confidence scores. Based on the annotated corpus, we introduce a simple yet robust system for automatic discourse-level topic chain parsing.

Cite

CITATION STYLE

APA

Zhang, L., Tan, X., Kong, F., & Zhou, G. (2021). EDTC: A Corpus for Discourse-Level Topic Chain Parsing. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 1304–1312). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.113

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free