Feature extraction using clustering of protein

Isis Bonet; Yvan Saeys; Ricardo Grau Ábalo; María M. García; Robersy Sanchez; Yves Van De Peer

Conference ProceedingsOPEN ACCESS

Feature extraction using clustering of protein

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4225 LNCS 614-623

DOI: 10.1007/11892755_64

6Citations

9Readers

Abstract

In this paper we investigate the usage of a clustering algorithm as a feature extraction technique to find new features to represent the protein sequence. In particular, our work focuses on the prediction of HIV protease resistance to drugs. We use a biologically motivated similarity function based on the contact energy of the amino acid and the position in the sequence. The performance measure was computed taking into account the clustering reliability and the classification validity. An SVM using 10-fold crossvalidation and the k-means algorithm were used for classification and clustering respectively. The best results were obtained by reducing an initial set of 99 features to a lower dimensional feature set of 36-66 features. © Springer-Verlag Berlin Heidelberg 2006.

Author supplied keywords

Cite

CITATION STYLE

APA

Bonet, I., Saeys, Y., Ábalo, R. G., García, M. M., Sanchez, R., & Van De Peer, Y. (2006). Feature extraction using clustering of protein. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4225 LNCS, pp. 614–623). Springer Verlag. https://doi.org/10.1007/11892755_64

Feature extraction using clustering of protein

Abstract

Author supplied keywords

Cite

Register to see more suggestions