Skip to content

Identifying relevant sources for data linking using a semantic web index

by Andriy Nikolov, Mathieu D'Aquin
CEUR Workshop Proceedings ()
  • ISSN: 16130073


With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. While catalogs of data repositories and meta-level descriptors such as VoiD provide valuable information to take these decisions, more detailed information about the instances included into repositories is often required to assess the relevance of datasets and the part of the dataset to link to. However, retrieving and processing such information for a potentially large number of datasets is practically unfeasible. In this paper, we examine how using an existing semantic web index can help identifying candidate datasets for linking. We further apply ontology schema matching techniques to rank these candidate datasets and extract the sub-dataset to use for linking, in the form of classes with instances more likely to match the ones of the local dataset.

Author-supplied keywords

Cite this document (BETA)

Readership Statistics

42 Readers on Mendeley
by Discipline
98% Computer Science
2% Engineering
by Academic Status
26% Student > Ph. D. Student
24% Researcher
14% Student > Master
by Country
7% Germany
5% Brazil
5% France

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Sign up & Download

Already have an account? Sign in