Sequential Clustering for Event Sequences and Its Impact on Next Process Step Prediction

12Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Next step prediction is an important problem in process analytics and it can be used in process monitoring to preempt failure in business processes. We are using logfiles from a workflow system that record the sequential execution of business processes. Each process execution results in a timestamped event. The main issue of analysing such event sequences is that they can be very diverse. Models that can effectively handle diverse sequences without losing the sequential nature of the data are desired. We propose an approach which clusters event sequences. Each cluster consists of similar sequences and the challenge is to identify a similarity measure that can cope with the sequential nature of the data. After clustering we build individual predictive models for each group. This strategy addresses both the sequential and diverse characteristics of our data. We first employ K-means and extent it into a categorical-sequential clustering algorithm by combining it with sequential alignment. Finally, we treat each resulting cluster by building individual Markov models of different orders, expecting that the representative characteristics of each cluster are captured. © Springer International Publishing Switzerland 2014.

Cite

CITATION STYLE

APA

Le, M., Nauck, D., Gabrys, B., & Martin, T. (2014). Sequential Clustering for Event Sequences and Its Impact on Next Process Step Prediction. In Communications in Computer and Information Science (Vol. 442 CCIS, pp. 168–178). Springer Verlag. https://doi.org/10.1007/978-3-319-08795-5_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free