On the Record: Provenance in Large Scale, Open, Distributed Systems
Abstract
Scientist increasingly rely on large scale, open distributed systems such as Grids in order to investigate a wide variety of research questions. In such systems, it is difficult to know exactly how a result is generated, however, such information is necessary for the scientific process. Therefore, it is vital that these systems have an automated mechanism for documenting process from which a result?s provenance can be retrieved. The provenance of a result is the process that led to that result. This thesis defines what provenance is for distributed systems based on the Service Oriented Architecture model. It presents a structure for the documentation of process from which the provenance of a result can be retrieved. Based on this structure, a set of patterns and a protocol are presented for recording assertions about processes in Service Oriented Architecture-based systems. An implementation of these specifications is then detailed followed by an evaluation of that implementation. Finally, a direction for future work is outlined. esse sequitur operari being follows functioning
Sign up today - FREE
Mendeley saves you time finding and organizing research. Learn more
- All your research in one place
- Add and import papers easily
- Access it anywhere, anytime

