We describe a Bayesian inference method for the identification of protein coding regions (active or residual) in DNA or RNA sequences. Its main feature is the computation of the conditional and a priori probabilities required in Bayes's formula by factoring each event (possible annotation) for a nucleotide string into the concatenation of shorter events, believed to be independent. The factoring allows us to obtain fast but reliable estimates for these parameters from readily available databases; whereas the probability estimation for unfactored events would require databases and tables of astronomical size. Promising results were obtained in tests with natural and artificial genomes. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Capua, R. O., Leitão, H. C. D. G., & Stolfi, J. (2007). Bayesian detection of coding regions in DNA/RNA sequences through event factoring. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4756 LNCS, pp. 624–634). Springer Verlag. https://doi.org/10.1007/978-3-540-76725-1_65
Mendeley helps you to discover research relevant for your work.