Use of profile hidden Markov models in viral discovery: current insights

  • Reyes A
  • P. Alves J
  • Durham A
  • et al.
N/ACitations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

Sequence similarity searches are the bioinformatic cornerstone of molecular sequence analysis for all domains of life. However, large amounts of divergence between organisms, such as those seen among viruses, can significantly hamper analyses. Profile hidden Markov models (profile HMMs) are among the most successful approaches for dealing with this problem, which represent an invaluable tool for viral identification efforts. Profile HMMs are statistical models that convert information from a multiple sequence alignment into a set of probability values that reflect position-specific variation levels in all members of evolutionarily related sequences. Since profile HMMs represent a wide spectrum of variation, these models show higher sensitivity than conventional similarity methods such as BLAST for the detection of remote homologs. In recent years, there has been an effort to compile viral sequences from different viral taxonomic groups into integrated databases , such as Prokaryotic Virus Orthlogous Groups (pVOGs) and database of profile HMMs (vFam) database, which provide functional annotation, multiple sequence alignments, and profile HMMs. Since these databases rely on viral sequences collected from GenBank and RefSeq, they suffer in variable extent from uneven taxonomic sampling, with low sequence representation of many viral groups, which affects the efficacy of the models. One of the interesting applications of viral profile HMMs is the detection and sequence reconstruction of specific viral genomes from metagenomic data. In fact, several DNA assembly programs that use profile HMMs as seeds have been developed to identify and build gene-sized assemblies or viral genome sequences of unrestrained length, using conventional and progressive assembly approaches, respectively. In this review, we address these aspects and cover some up-to-date information on viral genomics that should be considered in the choice of molecular markers for viral discovery. Finally, we propose a roadmap for rational development of viral profile HMMs and discuss the main challenges associated with this task. Keywords: profile hidden Markov models, viral discovery, DNA assembly, metagenomic analysis, molecular markers, de novo diagnosis Profile hidden Markov models: introductory concepts The goal of this section is to introduce the concepts necessary to understand the applicability of profile hidden Markov models (profile HMMs) for the process of viral discovery. For a more in-depth description on profile HMMs, the reader is referred to Durbin's excellent book. 1 Similarity searches and alignments A very important part of the current study of living organisms relies on sequencing their genome and transcriptome to unravel functional aspects of their biology. The

Cite

CITATION STYLE

APA

Reyes, A., P. Alves, J. M., Durham, A. M., & Gruber, A. (2017). Use of profile hidden Markov models in viral discovery: current insights. Advances in Genomics and Genetics, Volume 7, 29–45. https://doi.org/10.2147/agg.s136574

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free