Human splicing site prediction is important for identifying the complete structure of genes in Human genomes. Machine learning method is capable of distinguishing the different splice sites in genes. For machine learning method, feature extraction is a key step in dealing with the problem of splicing site identification. Encoding schema is a widely used method to encode gene sequences by feature vectors. However, this method ignores the information of the period-3 behavior of the splice sites and sequential information in the sequence. In this paper, a new feature extraction method, based on orthogonal encoding, codon usage and the sequential information, is proposed to map splice site sequences into feature vectors. Classification is performed using a Support Vector Machine (SVM) method. The experimental results show that the new method can predict human splice sites with high classification accuracy. © 2012 Springer-Verlag.
CITATION STYLE
Wei, D., Zhuang, W., Jiang, Q., & Wei, Y. (2012). A new classification method for human gene splice site prediction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7231 LNCS, pp. 121–130). https://doi.org/10.1007/978-3-642-29361-0_16
Mendeley helps you to discover research relevant for your work.