Abstract
Running experiments on modern systems like supercomputers, cloud infrastructures or P2P networks became very complex, both technically and methodologically. It is difficult to re-run an experiment or understand its results even with technical background on the technology and methods used. Storing the provenance of experimental data, i.e., storing information about how the results were produced, proved to be a powerful tool to address similar problems in computational natural sciences. In this paper, we (1) survey provenance collection in various domains of computer science, (2) introduce a new classification of provenance types, and (3) sketch a design of a provenance system inspired by this classification.
Cite
CITATION STYLE
Buchert, T., Nussbaum, L., & Gustedt, J. (2015). Towards complete tracking of provenance in experimental distributed systems research. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9523, pp. 604–616). Springer Verlag. https://doi.org/10.1007/978-3-319-27308-2_49
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.