Metadata Discovery Using Data Sampling and Exploratory Data Analysis

Hiba Khalid; Robert Wrembel; Esteban Zimányi

Conference Proceedings

Metadata Discovery Using Data Sampling and Exploratory Data Analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11815 LNCS 106-120

DOI: 10.1007/978-3-030-32065-2_8

0Citations

4Readers

Get full text

Abstract

Metadata discovery is a prominent contributor towards understanding the semantics of data, relationships between data, and fundamental data features for the purpose of data management, query processing, and data integration. Metadata discovery is constantly evolving with the help of data profiling and manual annotators, resulting in various good quality data profiling techniques and tools. Even though, there are different metadata standards specified for distinct fields such as finance, biology, experimental physics, medicine, there is no generic method that discovers metadata automatically or presents them in a unified way. In this paper, we present a technique for discovering and generating metadata for data sources that do not provide explicit metadata. To this end, we apply exploratory data analysis to produce two kinds of metadata, i.e., administrative and technical, in order to find similarities between resources, w.r.t. their structures and contents. Our technique was evaluated experimentally. The results show that the technique allows to identify similar data sources and compute their similarity measures.

Author supplied keywords

Cite

CITATION STYLE

APA

Khalid, H., Wrembel, R., & Zimányi, E. (2019). Metadata Discovery Using Data Sampling and Exploratory Data Analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11815 LNCS, pp. 106–120). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-32065-2_8

Metadata Discovery Using Data Sampling and Exploratory Data Analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions