Process discovery is widely used in business process intelligence to reconstruct process models from event logs recorded by information systems. With the increase of complexity and flexibility of processes, it is getting more and more challenging for discovery algorithms to generate accurate and comprehensive models. Trace clustering aims to overcome this issue by splitting event logs into smaller behavioral similar sub-logs. From these sub-logs more accurate and comprehensive process models can be reconstructed. In this paper, we propose a novel clustering approach that uses frequent itemset mining on the case attributes to also reveal relationships on the data perspective. Our approach includes this additional knowledge as well as optimizes the fitness of the underlying process models of each cluster to generate accurate clustering results. We compare our method with six other clustering methods and evaluate our approach using synthetic and real-life event logs.
CITATION STYLE
Seeliger, A., Nolle, T., & Mühlhäuser, M. (2018). Finding structure in the unstructured: Hybrid feature set clustering for process discovery. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11080 LNCS, pp. 288–304). Springer Verlag. https://doi.org/10.1007/978-3-319-98648-7_17
Mendeley helps you to discover research relevant for your work.