Revisiting Sparse Retrieval for Few-shot Entity Linking

2Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Entity linking aims to link ambiguous mentions to their corresponding entities in a knowledge base. One of the key challenges comes from insufficient labeled data for specific domains. Although dense retrievers have achieved excellent performance on several benchmarks, their performance decreases significantly when only a limited amount of in-domain labeled data is available. In such few-shot setting, we revisit the sparse retrieval method, and propose an ELECTRA-based keyword extractor to denoise the mention context and construct a better query expression. For training the extractor, we propose a distant supervision method to automatically generate training data based on overlapping tokens between mention contexts and entity descriptions. Experimental results on the ZESHEL dataset demonstrate that the proposed method outperforms state-of-the-art models by a significant margin across all test domains, showing the effectiveness of keyword-enhanced sparse retrieval. Code is available at https://github.com/HITsz-TMG/Sparse-Retrieval-Fewshot-EL.

Cite

CITATION STYLE

APA

Chen, Y., Xu, Z., Hu, B., & Zhang, M. (2023). Revisiting Sparse Retrieval for Few-shot Entity Linking. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 12801–12806). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.789

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free