GraphER: Token-centric entity resolution with graph convolutional neural networks

46Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

Abstract

Entity resolution (ER) aims to identify entity records that refer to the same real-world entity, which is a critical problem in data cleaning and integration. Most of the existing models are attribute-centric, that is, matching entity pairs by comparing similarities of pre-aligned attributes, which require the schemas of records to be identical and are too coarse-grained to capture subtle key information within a single attribute. In this paper, we propose a novel graph-based ER model GraphER. Our model is token-centric: the final matching results are generated by directly aggregating token-level comparison features, in which both the semantic and structural information has been softly embedded into token embeddings by training an Entity Record Graph Convolutional Network (ER-GCN). To the best of our knowledge, our work is the first effort to do token-centric entity resolution with the help of GCN in entity resolution task. Extensive experiments on two real-world datasets demonstrate that our model stably outperforms state-of-the-art models.

Cite

CITATION STYLE

APA

Li, B., Wang, W., Sun, Y., Zhang, L., Ali, M. A., & Wang, Y. (2020). GraphER: Token-centric entity resolution with graph convolutional neural networks. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 8172–8179). AAAI press. https://doi.org/10.1609/aaai.v34i05.6330

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free