CAMLS: A constraint-based Apriori algorithm for mining long sequences

5Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Mining sequential patterns is a key objective in the field of data mining due to its wide range of applications. Given a database of sequences, the challenge is to identify patterns which appear frequently in different sequences. Well known algorithms have proved to be efficient, however these algorithms do not perform well when mining databases that have long frequent sequences. We present CAMLS, Constraint-based Apriori Mining of Long Sequences, an efficient algorithm for mining long sequential patterns under constraints. CAMLS is based on the apriori property and consists of two phases, event-wise and sequence-wise, which employ an iterative process of candidate-generation followed by frequency-testing. The separation into these two phases allows us to: (i) introduce a novel candidate pruning strategy that increases the efficiency of the mining process and (ii) easily incorporate considerations of intra-event and inter-event constraints. Experiments on both synthetic and real datasets show that CAMLS outperforms previous algorithms when mining long sequences. © Springer-Verlag Berlin Heidelberg 2010.

Cite

CITATION STYLE

APA

Gonen, Y., Gal-Oz, N., Yahalom, R., & Gudes, E. (2010). CAMLS: A constraint-based Apriori algorithm for mining long sequences. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5981 LNCS, pp. 63–77). https://doi.org/10.1007/978-3-642-12026-8_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free