Feature analysis and classification of protein secondary structure data

S. Y.M. Shi; P. N. Suganthan

Journal Article

Feature analysis and classification of protein secondary structure data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2714 1151-1158

DOI: 10.1007/3-540-44989-2_137

11Citations

3Readers

Get full text

Abstract

In this paper, we investigate feature analysis for the prediction of the secondary structure of protein sequences using support vector machines (SVMs) and k-nearest neighbor algorithm (kNN). We apply feature selection and scaling techniques to obtain a number of distinct feature subsets with different features and each scaled differently. The feature selection and the scaling are performed using the mutual information (MI). We formulate the feature selection and scaling as combinatorial optimization problem and obtain solutions using a Hopfield-style algorithm. Our experimental results show that the feature subset selection improves the performance for both SVM and kNN while the feature scaling is consistently beneficial for kNN. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Shi, S. Y. M., & Suganthan, P. N. (2003). Feature analysis and classification of protein secondary structure data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2714, 1151–1158. https://doi.org/10.1007/3-540-44989-2_137

Feature analysis and classification of protein secondary structure data

Abstract

Cite

Register to see more suggestions