ToPS: A Framework to Manipulate Probabilistic Models of Sequence Data

5Citations
Citations of this article
42Readers
Mendeley users who have this article in their library.

Abstract

Discrete Markovian models can be used to characterize patterns in sequences of values and have many applications in biological sequence analysis, including gene prediction, CpG island detection, alignment, and protein profiling. We present ToPS, a computational framework that can be used to implement different applications in bioinformatics analysis by combining eight kinds of models: (i) independent and identically distributed process; (ii) variable-length Markov chain; (iii) inhomogeneous Markov chain; (iv) hidden Markov model; (v) profile hidden Markov model; (vi) pair hidden Markov model; (vii) generalized hidden Markov model; and (viii) similarity based sequence weighting. The framework includes functionality for training, simulation and decoding of the models. Additionally, it provides two methods to help parameter setting: Akaike and Bayesian information criteria (AIC and BIC). The models can be used stand-alone, combined in Bayesian classifiers, or included in more complex, multi-model, probabilistic architectures using GHMMs. In particular the framework provides a novel, flexible, implementation of decoding in GHMMs that detects when the architecture can be traversed efficiently. © 2013 Kashiwabara et al.

Cite

CITATION STYLE

APA

Kashiwabara, A. Y., Bonadio, Í., Onuchic, V., Amado, F., Mathias, R., & Durham, A. M. (2013). ToPS: A Framework to Manipulate Probabilistic Models of Sequence Data. PLoS Computational Biology, 9(10). https://doi.org/10.1371/journal.pcbi.1003234

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free