In many high-dimensional microarray classification problems, an important task is to identify subsets of genes that best discriminate the classes. Nevertheless, existing gene selection methods for microarray classification cannot identify which classes are discriminable by these selected genes. In this paper, we propose an improved linear discriminant analysis (LDA) method that simultaneously selects important genes and identifies the discriminable classes. Specifically, a pairwise fusion penalty for LDA was used to shrink the differences of the class centroids in pairs for each variable and fuse the centroids of indiscriminable classes altogether. The numerical results in analyzing 2 gene expression profiles demonstrate the proposed approach help improve the interpretation of important genes in microarray classification problems. © 2010 The Author.
CITATION STYLE
Guo, J. (2010). Simultaneous variable selection and class fusion for high-dimensional linear discriminant analysis. Biostatistics, 11(4), 599–608. https://doi.org/10.1093/biostatistics/kxq023
Mendeley helps you to discover research relevant for your work.