Heuristic-based configuration learning for linked data instance matching

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Instance matching in linked data has become increasingly important because of the rapid development of linked data. The goal of instance matching is to detect co-referent instances that refer to the same real-world objects. In order to realize such instances, instance matching systems use a configuration, which specifies the matching properties, similarity measures, and other settings of the matching process. For different repositories, the configuration is varied to adapt with the particular characteristics of the input. Therefore, the automation of configuration creation is very important. In this paper, we propose cLink, a supervised instance matching system for linked data. cLink is enhanced by a heuristic algorithm that learns the optimal configuration on the basic of input repositories. We show that cLink can achieve effective performance even when being given only a small amount of training data. Compared to previous configuration learning algorithms, our algorithm significantly improves the results. Compared to the recent supervised systems, cLink is also consistently better on all tested datasets.

Cite

CITATION STYLE

APA

Nguyen, K., & Ichise, R. (2016). Heuristic-based configuration learning for linked data instance matching. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9544, pp. 56–72). Springer Verlag. https://doi.org/10.1007/978-3-319-31676-5_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free