Feature analysis and classification of protein secondary structure data

11Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we investigate feature analysis for the prediction of the secondary structure of protein sequences using support vector machines (SVMs) and k-nearest neighbor algorithm (kNN). We apply feature selection and scaling techniques to obtain a number of distinct feature subsets with different features and each scaled differently. The feature selection and the scaling are performed using the mutual information (MI). We formulate the feature selection and scaling as combinatorial optimization problem and obtain solutions using a Hopfield-style algorithm. Our experimental results show that the feature subset selection improves the performance for both SVM and kNN while the feature scaling is consistently beneficial for kNN. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Shi, S. Y. M., & Suganthan, P. N. (2003). Feature analysis and classification of protein secondary structure data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2714, 1151–1158. https://doi.org/10.1007/3-540-44989-2_137

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free