Using SOM-based data binning to support supervised variable selection

Sampsa Laine; Timo Similä

Journal Article

Using SOM-based data binning to support supervised variable selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3316 172-180

DOI: 10.1007/978-3-540-30499-9_25

3Citations

7Readers

Get full text

Abstract

We propose a robust and understandable algorithm for supervised variable selection. The user defines a problem by manually selecting the variables Y that are used to train a Self-Organizing Map (SOM), which best describes the problem of interest. This is an illustrative problem definition even in multivariate case. The user also defines another set X, which contains variables that may be related to the problem. Our algorithm browses subsets of X and returns the one, which contains most information of the user's problem. We measure information by mapping small areas of the studied subset to the SOM lattice. We return the variable set providing, on average, the most compact mapping. By analysis of public domain data sets and by comparison against other variable selection methods, we illustrate the main benefit of our method: understandability to the common user. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Laine, S., & Similä, T. (2004). Using SOM-based data binning to support supervised variable selection. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3316, 172–180. https://doi.org/10.1007/978-3-540-30499-9_25

Using SOM-based data binning to support supervised variable selection

Abstract

Cite

Register to see more suggestions