Exact computation of pattern probabilities in random sequences generated by Markov chains

15Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

Observed patterns in macromolecular sequences are often considered as words and compared with their probabilities of occurring in random sequences. Calculation of these probabilities, however, often lacks rigour. We have developed an algorithm for exact computation of such probabilities for stochastic sequences that follow a Markov chain model. The method is applicable to the case that a random sequence contains one out of two given patterns P and Q, or both simultaneously. Another application yields the probability Junction P(x) that a sequence contains pattern P exactly x times. An application to patterns that include wild-card characters yields probabilities for homonucleotide clusters of a given length. We prove the probability of multiple runs of single nucleotides in the SV40 genome to be in accordance with the dinucleotide composition of the sequence, although it is in conflict with mononucleotide composition. © 1989 IRL Press.

Cite

CITATION STYLE

APA

Kleffe, J., & Langbecker, U. (1990). Exact computation of pattern probabilities in random sequences generated by Markov chains. Bioinformatics, 6(4), 347–353. https://doi.org/10.1093/bioinformatics/6.4.347

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free