Supervised selection of dynamic features, with an application to telecommunication data preparation

Sylvain Ferrandiz; Marc Boullé

Conference Proceedings

Supervised selection of dynamic features, with an application to telecommunication data preparation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4065 LNAI 239-249

DOI: 10.1007/11790853_19

1Citations

5Readers

Get full text

Abstract

In the field of data mining, data preparation has more and more in common with a bottleneck. Indeed, collecting and storing data becomes cheaper while modelling costs remain unchanged. As a result, feature selection is now usually performed. In the data preparation step, selection often relies on feature ranking. In the supervised classification context, ranking is based on the information that the explanatory feature brings on the target categorical attribute. With the increasing presence in the database of feature measured over time, i.e. dynamic features, new supervised ranking methods have to be designed. In this paper, we propose a new method to evaluate dynamic features, which is derived from a probabilistic criterion. The criterion is non-parametric and handles automatically the problem of overfitting the data. The resulting evaluation produces reliable results. Furthermore, the design of the criterion relies on an understandable and simple approach, This allows to provide meaningful visualization of the evaluation, in addition to the computed score. The advantages of the new method are illustrated on a telecommunication dataset. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Ferrandiz, S., & Boullé, M. (2006). Supervised selection of dynamic features, with an application to telecommunication data preparation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4065 LNAI, pp. 239–249). Springer Verlag. https://doi.org/10.1007/11790853_19

Supervised selection of dynamic features, with an application to telecommunication data preparation

Abstract

Cite

Register to see more suggestions