Enabling provenance on large scale e-Science applications

Miguel Branco; Luc Moreau

Conference ProceedingsOPEN ACCESS

Enabling provenance on large scale e-Science applications

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4145 LNCS 55-63

DOI: 10.1007/11890850_7

6Citations

16Readers

Abstract

Large-scale e-Science experiments present unprecedented data handling requirements with their multi-petabyte data storages. Complex software applications, such as the ATLAS High Energy Physics experiment at CERN, run throughout Grid computing sites around the world in a distributed environment, with scientists performing concurrent analysis on data and producing new data products shared among the collaboration. In this paper, we introduce a multi-phase infrastructure to achieve data provenance for an e-Science experiment. We propose an infrastructure to integrate provenance onto an existing legacy application with strong emphasis on scalability and explore the relationship between provenance and metadata introducing a model where data provenance is made available as metadata through a separate reasoning phase. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Branco, M., & Moreau, L. (2006). Enabling provenance on large scale e-Science applications. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4145 LNCS, pp. 55–63). Springer Verlag. https://doi.org/10.1007/11890850_7

Readers' Seniority

PhD / Post grad / Masters / Doc 7

54%

Researcher 4

31%

Professor / Associate Prof. 1

Lecturer / Post doc 1

Readers' Discipline

Computer Science 14

88%

Social Sciences 1

Mathematics 1

Enabling provenance on large scale e-Science applications

Abstract

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline