Supervised evaluation of dataset partitions: Advantages and practice

Sylvain Ferrandiz; Marc Boullé

Conference Proceedings

Supervised evaluation of dataset partitions: Advantages and practice

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3587 LNAI 600-609

DOI: 10.1007/11510888_59

1Citations

3Readers

Get full text

Abstract

In the context of large databases, data preparation takes a greater importance: instances and explanatory attributes have to be carefully selected. In supervised learning, instances partitioning techniques have been developped for univariate representations, leading to precise and comprehensible evaluations of the amount of information contained in an attribute, with respect to the target attribute. Still, the multivariate case remains unstated. In this paper, we describe the partitioning intrinsic convenience for data preparation and we settle a framework for supervised partitioning. A new evaluation criterion of labelled objects partitions, which is based on Minimum Description Length principle, is then set and tested on real and synthetic data sets. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Ferrandiz, S., & Boullé, M. (2005). Supervised evaluation of dataset partitions: Advantages and practice. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3587 LNAI, pp. 600–609). Springer Verlag. https://doi.org/10.1007/11510888_59

Supervised evaluation of dataset partitions: Advantages and practice

Abstract

Cite

Register to see more suggestions