Evaluation of techniques for classifying biological sequences

Mukund Deshpande; George Karypis

Conference Proceedings

Evaluation of techniques for classifying biological sequences

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2336 417-431

DOI: 10.1007/3-540-47887-6_41

47Citations

26Readers

Get full text

Abstract

In recent years we have witnessed an exponential increase in the amount of biological information, either DNA or protein sequences, that has become available in public databases. This has been followed by an increased interest in developing computational techniques to automatically classify these large volumes of sequence data into various categories corresponding to either their role in the chromosomes, their structure, and/or their function. In this paper we evaluate some of the widely-used sequence classification algorithms and develop a framework for modeling sequences in a fashion so that traditional machine learning algorithms, such as support vector machines, can be applied easily. Our detailed experimental evaluation shows that the SVM-based approaches are able to achieve higher classification accuracy compared to the more traditional sequence classification algorithms such as Markov model based techniques and K-nearest neighbor based approaches.

Cite

CITATION STYLE

APA

Deshpande, M., & Karypis, G. (2002). Evaluation of techniques for classifying biological sequences. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2336, pp. 417–431). Springer Verlag. https://doi.org/10.1007/3-540-47887-6_41

Evaluation of techniques for classifying biological sequences

Abstract

Cite

Register to see more suggestions