Abstract
Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset. © 2009 Springer Berlin Heidelberg.
Author supplied keywords
Cite
CITATION STYLE
Varadwaj, P., Purohit, N., & Arora, B. (2009). Detection of splice sites using support vector machine. In Communications in Computer and Information Science (Vol. 40, pp. 493–502). https://doi.org/10.1007/978-3-642-03547-0_47
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.