Abstract provenance graphs: Anticipating and exploiting schema-level data provenance

Daniel Zinn; Bertram Ludäscher

Conference ProceedingsOPEN ACCESS

Abstract provenance graphs: Anticipating and exploiting schema-level data provenance

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6378 LNCS 206-215

DOI: 10.1007/978-3-642-17819-1_23

4Citations

28Readers

Abstract

Provenance graphs capture flow and dependency information recorded during scientific workflow runs, which can be used subsequently to interpret, validate, and debug workflow results. In this paper, we propose the new concept of Abstract Provenance Graphs (APGs). APGs are created via static analysis of a configured workflow W and input data schema, i.e., before W is actually executed. They summarize all possible provenance graphs the workflow W can create with input data of type τ, that is, for each input ν ∈ τ there exists a graph homomorphism ℋν between the concrete and abstract provenance graph. APGs are helpful during workflow construction since (1) they make certain workflow design-bugs (e.g., selecting none or wrong input data for the actors) easy to spot; and (2) show the evolution of the overall data organization of a workflow. Moreover, after workflows have been run, APGs can be used to validate concrete provenance graphs. A more detailed version of this work is available as [14]. © 2010 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Zinn, D., & Ludäscher, B. (2010). Abstract provenance graphs: Anticipating and exploiting schema-level data provenance. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6378 LNCS, pp. 206–215). https://doi.org/10.1007/978-3-642-17819-1_23

Abstract provenance graphs: Anticipating and exploiting schema-level data provenance

Abstract

Cite

Register to see more suggestions