Abstract
A novel approach to feature selection from unlabeled vector data is presented. It is based on the reconstruction of original data relationships in an auxiliary space with either weighted or omitted features. Feature weighting, on one hand, is related to the return forces of factors in a parametric data similarity measure as response to disturbance of their optimum values. Feature omission, on the other hand, inducing measurable loss of reconstruction quality, is realized in an iterative greedy way. The proposed framework allows to apply custom data similarity measures. Here, adaptive Euclidean distance and adaptive Pearson correlation are considered, the former serving as standard reference, the latter being usefully for intensity data. Results of the different strategies are given for chromatography and gene expression data. Springer-Verlag Berlin Heidelberg 2006.
Author supplied keywords
Cite
CITATION STYLE
Strickert, M., Sreenivasulu, N., Peterek, S., Weschke, W., Mock, H. P., & Seiffert, U. (2006). Unsupervised feature selection for biomarker identification in chromatography and gene expression data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4087 LNAI, pp. 274–285). Springer Verlag. https://doi.org/10.1007/11829898_25
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.