K-NN for the classification of human cancer samples using the gene expression profiles

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The -Nearest Neighbor (k-NN) classifier has been applied to the identification of cancer samples using the gene expression profiles with encouraging results. However, the performance of -NN depends strongly on the distance considered to evaluate the sample proximities. Besides, the choice of a good dissimilarity is a difficult task and depends on the problem at hand. In this chapter, we introduce a method to learn the metric from the data to improve the -NN classifier. To this aim, we consider a regularized version of the kernel alignment algorithm that incorporates a term that penalizes the complexity of the family of distances avoiding overfitting. The error function is optimized using a semidefinite programming approach (SDP). The method proposed has been applied to the challenging problem of cancer identification using the gene expression profiles. Kernel alignment -NN outperforms other metric learning strategies and improves the classical -NN algorithm. © 2010 Springer Science+Business Media, LLC.

Cite

CITATION STYLE

APA

Martín-Merino, M. (2010). K-NN for the classification of human cancer samples using the gene expression profiles. In Advances in Experimental Medicine and Biology (Vol. 680, pp. 157–164). https://doi.org/10.1007/978-1-4419-5913-3_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free