Constraint-based mining of sequential patterns over datasets with consecutive repetitions

Marion Leleu; Christophe Rigotti; Jean François Boulicaut; Guillaume Euvrard

Conference ProceedingsOPEN ACCESS

Constraint-based mining of sequential patterns over datasets with consecutive repetitions

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2003) 2838 303-314

DOI: 10.1007/978-3-540-39804-2_28

26Citations

22Readers

Abstract

Constraint-based mining of sequential patterns is an active research area motivated by many application domains. In practice, the real sequence datasets can present consecutive repetitions of symbols (e.g., DNA sequences, discretized stock market data) that can lead to a very important consumption of resources during the extraction of patterns that can turn even efficient algorithms to become unusable. We propose a constraint-based mining algorithm using an approach that enables to compact these consecutive repetitions, reducing drastically the amount of data to process and speeding-up the extraction time. The technique introduced in this paper allows to retain the advantages of existing state-of-the-art algorithms based on the notion of occurrence lists, while permitting to extend their application fields to datasets containing consecutive repetitions. We analyze the benefits obtained using synthetic datasets, and show that the approach is of practical interest on real datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Leleu, M., Rigotti, C., Boulicaut, J. F., & Euvrard, G. (2003). Constraint-based mining of sequential patterns over datasets with consecutive repetitions. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2838, pp. 303–314). Springer Verlag. https://doi.org/10.1007/978-3-540-39804-2_28

Constraint-based mining of sequential patterns over datasets with consecutive repetitions

Abstract

Author supplied keywords

Cite

Register to see more suggestions