Mining frequent sequences using itemset-based extension

Ma Zhixin; Xu Yusheng; Tharam S. Dillon

Conference Proceedings

Mining frequent sequences using itemset-based extension

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8178 LNAI 1-9

DOI: 10.1007/978-3-319-04048-6_1

0Citations

6Readers

Get full text

Abstract

In this paper, we systematically explore an itemset-based extension approach for generating candidate sequence which contributes to a better and more straightforward search space traversal performance than traditional item-based extension approach. Based on this candidate generation approach, we present FINDER, a novel algorithm for discovering the set of all frequent sequences. FINDER is composed of two separated steps. In the first step, all frequent itemsets are discovered and we can get great benefit from existing efficient itemset mining algorithms. In the second step, all frequent sequences with at least two frequent itemsets are detected by combining depth-first search and itemset-based extension candidate generation together. A vertical bitmap data representation is adopted for rapidly support counting reason. Several pruning strategies are used to reduce the search space and minimize cost of computation. An extensive set of experiments demonstrate the effectiveness and the linear scalability of proposed algorithm. © Springer International Publishing Switzerland 2013.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhixin, M., Yusheng, X., & Dillon, T. S. (2013). Mining frequent sequences using itemset-based extension. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8178 LNAI, pp. 1–9). Springer Verlag. https://doi.org/10.1007/978-3-319-04048-6_1

Mining frequent sequences using itemset-based extension

Abstract

Author supplied keywords

Cite

Register to see more suggestions