K-NN for the classification of human cancer samples using the gene expression profiles

Manuel Martín-Merino

Conference Proceedings

K-NN for the classification of human cancer samples using the gene expression profiles

Martín-Merino M

Advances in Experimental Medicine and Biology (2010) 680 157-164

DOI: 10.1007/978-1-4419-5913-3_18

1Citations

9Readers

Get full text

Abstract

The -Nearest Neighbor (k-NN) classifier has been applied to the identification of cancer samples using the gene expression profiles with encouraging results. However, the performance of -NN depends strongly on the distance considered to evaluate the sample proximities. Besides, the choice of a good dissimilarity is a difficult task and depends on the problem at hand. In this chapter, we introduce a method to learn the metric from the data to improve the -NN classifier. To this aim, we consider a regularized version of the kernel alignment algorithm that incorporates a term that penalizes the complexity of the family of distances avoiding overfitting. The error function is optimized using a semidefinite programming approach (SDP). The method proposed has been applied to the challenging problem of cancer identification using the gene expression profiles. Kernel alignment -NN outperforms other metric learning strategies and improves the classical -NN algorithm. © 2010 Springer Science+Business Media, LLC.

Author supplied keywords

Cite

CITATION STYLE

APA

Martín-Merino, M. (2010). K-NN for the classification of human cancer samples using the gene expression profiles. In Advances in Experimental Medicine and Biology (Vol. 680, pp. 157–164). https://doi.org/10.1007/978-1-4419-5913-3_18

K-NN for the classification of human cancer samples using the gene expression profiles

Abstract

Author supplied keywords

Cite

Register to see more suggestions