In the context of large databases, data preparation takes a greater importance: instances and explanatory attributes have to be carefully selected. In supervised learning, instances partitioning techniques have been developped for univariate representations, leading to precise and comprehensible evaluations of the amount of information contained in an attribute, with respect to the target attribute. Still, the multivariate case remains unstated. In this paper, we describe the partitioning intrinsic convenience for data preparation and we settle a framework for supervised partitioning. A new evaluation criterion of labelled objects partitions, which is based on Minimum Description Length principle, is then set and tested on real and synthetic data sets. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Ferrandiz, S., & Boullé, M. (2005). Supervised evaluation of dataset partitions: Advantages and practice. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3587 LNAI, pp. 600–609). Springer Verlag. https://doi.org/10.1007/11510888_59
Mendeley helps you to discover research relevant for your work.