An MCMC algorithm for detecting short adjacent repeats shared by multiple sequences

3Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Repeats detection problems are traditionally formulated as string matching or signal processing problems. They cannot readily handle gaps between repeat units and are incapable of detecting repeat patterns shared by multiple sequences. This study detects short adjacent repeats with interunit insertions from multiple sequences. For biological sequences, such studies can shed light on molecular structure, biological function and evolution. Results: The task of detecting short adjacent repeats is formulated as a statistical inference problem by using a probabilistic generative model. An Markov chain Monte Carlo algorithm is proposed to infer the parameters in a de novo fashion. Its applications on synthetic and real biological data show that the new method not only has a competitive edge over existing methods, but also can provide a way to study the structure and the evolution of repeat-containing genes. © The Author 2011. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Li, Q., Fan, X., Liang, T., & Li, S. Y. R. (2011). An MCMC algorithm for detecting short adjacent repeats shared by multiple sequences. Bioinformatics, 27(13), 1772–1779. https://doi.org/10.1093/bioinformatics/btr287

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free