Robust remote homology detection by feature based profile hidden Markov models

  • Plötz T
  • Fink G
  • 6

    Readers

    Mendeley users who have this article in their library.
  • 5

    Citations

    Citations of this article.

Abstract

The detection of remote homologies is of major importance for molecular biology ap- plications like drug discovery. The problem is still very challenging even for state-of-the-art probabilistic models of protein families, namely Profile HMMs. In order to improve re- mote homology detection we propose feature based semi-continuous Profile HMMs. Based on a richer sequence representation consisting of features which capture the biochemical properties of residues in their local context, family specific semi-continuous models are estimated completely data-driven. Additionally, for substantially reducing the number of false predictions an explicit rejection model is estimated. Both the family specific semi- continuous Profile HMM and the non-target model are competitively evaluated. In the experimental evaluation of superfamily based screening of the SCOP database we demonstrate that semi-continuous Profile HMMs significantly outperform their discrete counterparts. Using the rejection model the number of false positive predictions could be reduced substantially which is an important prerequisite for target identification applica- tions.

Author-supplied keywords

  • Feature representation
  • Profile Hidden Markov Models (Profile HMMs)
  • Protein sequence analysis
  • Remote homology detection
  • Target identification

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • Thomas Plötz

  • Gernot A. Fink

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free