Dealing with semantic heterogeneity during data integration

Zoubida Kedad; Elisabeth Métais

Conference Proceedings

Dealing with semantic heterogeneity during data integration

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1999) 1728 325-339

DOI: 10.1007/3-540-47866-3_22

31Citations

12Readers

Get full text

Abstract

Multi-sources information systems, such as data warehouse systems, involve heterogeneous sources. In this paper, we deal with the semantic heterogeneity of the data instances. Problems may occur when confronting sources, each time different level of denominations have been used for the same value, e.g. “vermilion” in one source, and “red” in another. We propose to manage this semantic heterogeneity by using a linguistic dictionary. "Semantic operators" allow a linguistic flexibility in the queries, e.g. two tuples with the values “red” and “vermilion” could match in a semantic join on the “color” attribute. A particularity of our approach is it states the scope of the flexibility by defining classes of equivalent values by the mean of “priority nodes”. They are used as parameters for allowing the user to define the scope of the flexibility in a very natural manner, without specifying any distance.

Cite

CITATION STYLE

APA

Kedad, Z., & Métais, E. (1999). Dealing with semantic heterogeneity during data integration. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1728, pp. 325–339). Springer Verlag. https://doi.org/10.1007/3-540-47866-3_22

Dealing with semantic heterogeneity during data integration

Abstract

Cite

Register to see more suggestions