In an mRNA sequence, the prediction of the exact codon where the process of translation starts (Translation Initiation Site - TIS) is a particularly important problem. So far it has been tackled by several researchers that apply various statistical and machine learning techniques, achieving high accuracy levels, often over 90%. In this paper we propose a mahine learning approach that can further improve the prediction accuracy. First, we provide a concise review of the literature in this field. Then we propose a novel feature set. We perform extensive experiments on a publicly available, real world dataset for various vertebrate organisms using a variety of novel features and classification setups. We evaluate our results and compare them with a reference study and show that our approach that involves new features and a combination of the Ribosome Scanning Model with a meta-classifier shows higher accuracy in most cases. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Tzanis, G., Berberidis, C., & Vlahavas, I. (2006). A novel data mining approach for the accurate prediction of translation initiation sites. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4345 LNBI, pp. 92–103). Springer Verlag. https://doi.org/10.1007/11946465_9
Mendeley helps you to discover research relevant for your work.