The model we present here formalizes the definition of Data Mining as the process of information generalization. In the model the Data Mining algorithms are defined as generalization operators. We show that only three generalizations operators: classification operator, clustering operator, and association operator are needed to express all Data Mining algorithms for classification, clustering, and association, respectively. The framework of the model allows to describe formally the hybrid systems; combination of classifiers into multi-classifiers, and combination of clustering with classification. We use our framework to show the classification, clustering and association analysis fall into three different generalization categories.
CITATION STYLE
Menasalvas1, E., & Wasilewska2, A. (2005). Data Mining as Generalization: A Formal Model. In Foundations and Novel Approaches in Data Mining (pp. 99–126). Springer-Verlag. https://doi.org/10.1007/11539827_6
Mendeley helps you to discover research relevant for your work.