Set Learning for Generative Information Extraction

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Recent efforts have endeavored to employ the sequence-to-sequence (Seq2Seq) model in Information Extraction (IE) due to its potential to tackle multiple IE tasks in a unified manner. Under this formalization, multiple structured objects are concatenated as the target sequence in a predefined order. However, structured objects, by their nature, constitute an unordered set. Consequently, this formalization introduces a potential order bias, which can impair model learning. Targeting this issue, this paper proposes a set learning approach that considers multiple permutations of structured objects to optimize set probability approximately. Notably, our approach does not require any modifications to model structures, making it easily integrated into existing generative IE frameworks. Experiments show that our method consistently improves existing frameworks on vast tasks and datasets.

Cite

CITATION STYLE

APA

Li, J., Zhang, Y., Liang, B., Wong, K. F., & Xu, R. (2023). Set Learning for Generative Information Extraction. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 13043–13052). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.806

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free