Dirichlet mixtures: A method for improved detection of weak but significant protein sequence homology

290Citations
Citations of this article
177Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a method for condensing the information in multiple alignments of proteins into a mixture of Dirichlet densities over amino acid distributions. Dirichiet mixture densities are designed to be combined with observed amino acid frequencies to form estimates of expected amino acid probabilities at each position in a profile, hidden Markov model or other statistical model. These estimates give a statistical model greater generalization capacity, so that remotely related family members can be more reliably recognized by the model. This paper corrects the previously published formula for estimating these expected probabilities, and contains complete derivations of the Dirichiet mixture formulas, methods for optimizing the mixtures to match particular databases, and suggestions for efficient implementation. © 1996, Oxford University Press.

Cite

CITATION STYLE

APA

Sjolander, K., Karplus, K., Brown, M., Hughey, R., Krogh, A., Saira Mian, I., & Haussler, D. (1996). Dirichlet mixtures: A method for improved detection of weak but significant protein sequence homology. Bioinformatics, 12(4), 327–345. https://doi.org/10.1093/bioinformatics/12.4.327

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free