Advances and challenges for scalable provenance in stream processing systems

Archan Misra; Marion Blount; Anastasios Kementsietsidis; Daby Sow; Min Wang

Conference Proceedings

Advances and challenges for scalable provenance in stream processing systems

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5272 253-265

DOI: 10.1007/978-3-540-89965-5_26

26Citations

45Readers

Get full text

Abstract

While data provenance is a well-studied topic in both database and workflow systems, its support within stream processing systems presents a new set of challenges. Part of the challenge is the high stream event rate and the low processing latency requirements imposed by many streaming applications. For example, emerging streaming applications in healthcare or finance call for data provenance, as illustrated in the Century stream processing infrastructure that we are building for supporting online healthcare analytics. At anytime, given an output data element (e.g., a medical alert) generated by Century, the system must be able to retrieve the input and intermediate data elements that led to its generation. In this paper, we describe the requirements behind our initial implementation of Century’s provenance subsystem. We then analyze its strengths and limitations and propose a new provenance architecture to address some of these limitations. The paper also includes a discussion on the open challenges in this area.

Cite

CITATION STYLE

APA

Misra, A., Blount, M., Kementsietsidis, A., Sow, D., & Wang, M. (2008). Advances and challenges for scalable provenance in stream processing systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5272, pp. 253–265). Springer Verlag. https://doi.org/10.1007/978-3-540-89965-5_26

Advances and challenges for scalable provenance in stream processing systems

Abstract

Cite

Register to see more suggestions