Provenance and the price of identity

Adriane Chapman; H. V. Jagadish

Conference ProceedingsOPEN ACCESS

Provenance and the price of identity

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5272 106-119

DOI: 10.1007/978-3-540-89965-5_12

4Citations

16Readers

Abstract

As developers acknowledge that provenance is essential, more and more datasets are attempting to keep provenance records describing how they were created. Some of these datasets are constructed using workflows, others cobble together processes and applications to manipulate the data. While the provenance needs are the same, the inputs and set of processes used must be kept, the identity needs are very different. We outline several identification strategies that can be used for data manipulation outside of workflows.We evaluate these strategies in terms of time to create and store identity, and the space needed to keep this information. Additionally, we discuss the strengths and weaknesses of each strategy.

Cite

CITATION STYLE

APA

Chapman, A., & Jagadish, H. V. (2008). Provenance and the price of identity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5272, pp. 106–119). Springer Verlag. https://doi.org/10.1007/978-3-540-89965-5_12

Provenance and the price of identity

Abstract

Cite

Register to see more suggestions