Abstract provenance graphs: Anticipating and exploiting schema-level data provenance

4Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

Abstract

Provenance graphs capture flow and dependency information recorded during scientific workflow runs, which can be used subsequently to interpret, validate, and debug workflow results. In this paper, we propose the new concept of Abstract Provenance Graphs (APGs). APGs are created via static analysis of a configured workflow W and input data schema, i.e., before W is actually executed. They summarize all possible provenance graphs the workflow W can create with input data of type τ, that is, for each input ν ∈ τ there exists a graph homomorphism ℋν between the concrete and abstract provenance graph. APGs are helpful during workflow construction since (1) they make certain workflow design-bugs (e.g., selecting none or wrong input data for the actors) easy to spot; and (2) show the evolution of the overall data organization of a workflow. Moreover, after workflows have been run, APGs can be used to validate concrete provenance graphs. A more detailed version of this work is available as [14]. © 2010 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Zinn, D., & Ludäscher, B. (2010). Abstract provenance graphs: Anticipating and exploiting schema-level data provenance. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6378 LNCS, pp. 206–215). https://doi.org/10.1007/978-3-642-17819-1_23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free