Dealing with semantic heterogeneity during data integration

31Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multi-sources information systems, such as data warehouse systems, involve heterogeneous sources. In this paper, we deal with the semantic heterogeneity of the data instances. Problems may occur when confronting sources, each time different level of denominations have been used for the same value, e.g. “vermilion” in one source, and “red” in another. We propose to manage this semantic heterogeneity by using a linguistic dictionary. "Semantic operators" allow a linguistic flexibility in the queries, e.g. two tuples with the values “red” and “vermilion” could match in a semantic join on the “color” attribute. A particularity of our approach is it states the scope of the flexibility by defining classes of equivalent values by the mean of “priority nodes”. They are used as parameters for allowing the user to define the scope of the flexibility in a very natural manner, without specifying any distance.

Cite

CITATION STYLE

APA

Kedad, Z., & Métais, E. (1999). Dealing with semantic heterogeneity during data integration. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1728, pp. 325–339). Springer Verlag. https://doi.org/10.1007/3-540-47866-3_22

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free