Coreference Resolution through a seq2seq Transition-Based System

44Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

Abstract

Most recent coreference resolution systems use search algorithms over possible spans to identify mentions and resolve coreference. We instead present a coreference resolution system that uses a text-to-text (seq2seq) paradigm to predict mentions and links jointly. We im-plement the coreference system as a transition system and use multilingual T5 as an underly-ing language model. We obtain state-of-the-art accuracy on the CoNLL-2012 datasets with 83.3 F1-score for English (a 2.3 higher F1-score than previous work [Dobrovolskii, 2021]) using only CoNLL data for training, 68.5 F1-score for Arabic (+4.1 higher than previous work), and 74.3 F1-score for Chinese (+5.3). In addition we use the SemEval-2010 data sets for experiments in the zero-shot set-ting, a few-shot setting, and supervised setting using all available training data. We obtain substantially higher zero-shot F1-scores for 3 out of 4 languages than previous approaches and significantly exceed previous supervised state-of-the-art results for all five tested lan-guages. We provide the code and models as open source.1.

Cite

CITATION STYLE

APA

Bohnet, B., Alberti, C., & Collins, M. (2023). Coreference Resolution through a seq2seq Transition-Based System. Transactions of the Association for Computational Linguistics, 11, 212–226. https://doi.org/10.1162/tacl_a_00543

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free