A noisy 10GB provenance database

8Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Provenance of scientific data is a key piece of the metadata record for the data's ongoing discovery and reuse. Provenance collection systems capture provenance on the fly, however, the protocol between application and provenance tool may not be reliable. Consequently, the provenance record can be partial, partitioned, and simply inaccurate. We use a workflow emulator that models faults to construct a large 10GB database of provenance that we know is noisy (that is, has errors). We discuss the process of generating the provenance database, and show early results on the kinds of provenance analysis enabled by the large provenance. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Cheah, Y. W., Plale, B., Kendall-Morwick, J., Leake, D., & Ramakrishnan, L. (2012). A noisy 10GB provenance database. In Lecture Notes in Business Information Processing (Vol. 100 LNBIP, pp. 370–381). Springer Verlag. https://doi.org/10.1007/978-3-642-28115-0_35

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free