The search for interesting Boolean association rules is an important topic in knowledge discovery in databases. The set of admissible rules for the selected support and confidence thresholds can easily be extracted by algorithms based on support and confidence, such as Apriori. However, they may produce a large number of rules, many of them are uninteresting. One has to resolve a two-tier problem: choosing the measures best suited to the problem at hand, then validating the interesting rules against the selected measures. First, the usual measures suggested in the literature will be reviewed and criteria to appreciate the qualities of these measures will be proposed. Statistical validation of the most interesting rules requests performing a large number of tests. Thus, controlling for false discoveries (type I errors) is of prime importance. An original bootstrap-based validation method is proposed which controls, for a given level, the number of false discoveries. The interest of this method for the selection of interesting association rules will be illustrated by several examples. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Lallich, S., Teytaud, O., & Prudhomme, E. (2007). Association rule Interestingness: Measure and statistical validation. Studies in Computational Intelligence, 43, 251–275. https://doi.org/10.1007/978-3-540-44918-8_11
Mendeley helps you to discover research relevant for your work.