A lossless data reduction for mining constrained patterns in n-ary relations

Gabriel Poesia; Loïc Cerf

Conference ProceedingsOPEN ACCESS

A lossless data reduction for mining constrained patterns in n-ary relations

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8725 LNAI(PART 2) 581-596

DOI: 10.1007/978-3-662-44851-9_37

2Citations

2Readers

Abstract

Given a binary relation, listing the itemsets takes exponential time. The problem grows worse when searching for analog patterns defined in n-ary relations. However, real-life relations are sparse and, with a greater number n of dimensions, they tend to be even sparser. Moreover, not all itemsets are searched. Only those satisfying some userdefined constraints, such as minimal size constraints. This article proposes to exploit together the sparsity of the relation and the presence of constraints satisfying a common property, the monotonicity w.r.t. one dimension. It details a pre-processing step to identify and erase n-tuples whose removal does not change the collection of patterns to be discovered. That reduction of the relation is achieved in a time and a space that is linear in the number of n-tuples. Experiments on two real-life datasets show that, whatever the algorithm used afterward to actually list the patterns, the pre-process allows to lower the overall running time by a factor typically ranging from 10 to 100. © 2014 Springer-Verlag.

Cite

CITATION STYLE

APA

Poesia, G., & Cerf, L. (2014). A lossless data reduction for mining constrained patterns in n-ary relations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8725 LNAI, pp. 581–596). Springer Verlag. https://doi.org/10.1007/978-3-662-44851-9_37

A lossless data reduction for mining constrained patterns in n-ary relations

Abstract

Cite

Register to see more suggestions