The diversity of coreference chains is usually tackled by means of global features (length, types and number of referring expressions, distance between them, etc.). In this paper, we propose a novel approach that provides a description of their composition in terms of sequences of expressions. To this end, we apply sequence analysis techniques to bring out the various strategies for introducing a referent and keeping it active throughout discourse. We discuss a first application of this method to a French written corpus annotated with coreference chains. We obtain clusters that are linguistically coherent and interpretable in terms of reference strategies and we demonstrate the influence of text genre and semantic type of the referent on chain composition.
CITATION STYLE
Federzoni, S., Ho-Dac, L. M., & Fabre, C. (2021). Coreference Chains Categorization by Sequence Clustering. In 2nd Workshop on Computational Approaches to Discourse, CODI 2021 - Proceedings of the Workshop (pp. 52–57). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.codi-main.5
Mendeley helps you to discover research relevant for your work.