Representing Temporal Attributes for Schema Matching

1Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Temporal data are prevalent, where one or several time attributes present. It is challenging to identify the temporal attributes from heterogeneous sources. The reason is that the same attribute could contain distinct values in different time spans, whereas different attributes may have highly similar timestamps and alike values. Existing studies on schema matching seldom explore the temporal information for matching attributes. In this paper, we argue to order the values in an attribute A by some time attribute T as a time series. To learn deep temporal features in the attribute pair (T, A), we devise an auto-encoder to embed the transitions of values in the time series into a vector. The temporal attribute matching (TAM) is thus to evaluate matching distance of two temporal attribute pairs by comparing their transition vectors. We show that computing the optimal matching distance is NP-hard, and present an approximation algorithm. Experiments on real datasets demonstrate the superiority of our proposal in matching temporal attributes compared to the generic schema matching approaches.

Cite

CITATION STYLE

APA

Mei, Y., Song, S., Lee, Y., Park, J., Kim, S. H., & Yi, S. (2020). Representing Temporal Attributes for Schema Matching. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 709–719). Association for Computing Machinery. https://doi.org/10.1145/3394486.3403115

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free