Querying and managing provenance through user views in scientific workflows

111Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Workflow systems have become increasingly popular for managing experiments where many bioinformatics tasks are chained together. Due to the large amount of data generated by these experiments and the need for reproducible results, provenance has become of paramount importance. Workflow systems are therefore starting to provide support for querying provenance. However, the amount of provenance information may be overwhelming, so there is a need for abstraction mechanisms to help users focus on the most relevant information. The technique we pursue is that of "user views." Since bioinformatics tasks may themselves be complex sub-workflows, a user view determines what level of sub-workflow the user can see, and thus what data and tasks are visible in provenance queries. In this paper, we formalize the notion of user views, demonstrate how they can be used in provenance queries, and give an algorithm for generating a user view based on which tasks are relevant for the user. We then describe our prototype and give performance results. Although presented in the context of scientific workflows, the technique applies to other data-oriented workflows. © 2008 IEEE.

Cite

CITATION STYLE

APA

Biton, O., Cohen-Boulakia, S., Davidson, S. B., & Hara, C. S. (2008). Querying and managing provenance through user views in scientific workflows. In Proceedings - International Conference on Data Engineering (pp. 1072–1081). https://doi.org/10.1109/ICDE.2008.4497516

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free