How much event data is enough? A statistical framework for process discovery

Martin Bauer; Arik Senderovich; Avigdor Gal; Lars Grunske; Matthias Weidlich

Conference ProceedingsOPEN ACCESS

How much event data is enough? A statistical framework for process discovery

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10816 LNCS 239-256

DOI: 10.1007/978-3-319-91563-0_15

23Citations

21Readers

Abstract

With the increasing availability of business process related event logs, the scalability of techniques that discover a process model from such logs becomes a performance bottleneck. In particular, exploratory analysis that investigates manifold parameter settings of discovery algorithms, potentially using a software-as-a-service tool, relies on fast response times. However, common approaches for process model discovery always parse and analyse all available event data, whereas a small fraction of a log could have already led to a high-quality model. In this paper, we therefore present a framework for process discovery that relies on statistical pre-processing of an event log and significantly reduce its size by means of sampling. It thereby reduces the runtime and memory footprint of process discovery algorithms, while providing guarantees on the introduced sampling error. Experiments with two public real-world event logs reveal that our approach speeds up state-of-the-art discovery algorithms by a factor of up to 20.

Author supplied keywords

Cite

CITATION STYLE

APA

Bauer, M., Senderovich, A., Gal, A., Grunske, L., & Weidlich, M. (2018). How much event data is enough? A statistical framework for process discovery. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10816 LNCS, pp. 239–256). Springer Verlag. https://doi.org/10.1007/978-3-319-91563-0_15

How much event data is enough? A statistical framework for process discovery

Abstract

Author supplied keywords

Cite

Register to see more suggestions