Detecting overlapping coding sequences in virus genomes

71Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Detecting new coding sequences (CDSs) in viral genomes can be difficult for several reasons. The typically compact genomes often contain a number of overlapping coding and non-coding functional elements, which can result in unusual patterns of codon usage; conservation between related sequences can be difficult to interpret - especially within overlapping genes; and viruses often employ non-canonical translational mechanisms - e.g. frameshifting, stop codon read-through, leaky-scanning and internal ribosome entry sites - which can conceal potentially coding open reading frames (ORFs). Results: In a previous paper we introduced a new statistic - MLOGD (Maximum Likelihood Overlapping Gene Detector) - for detecting and analysing overlapping CDSs. Here we present (a) an improved MLOGD statistic, (b) a greatly extended suite of software using MLOGD, (c) a database of results for 640 virus sequence alignments, and (d) a web-interface to the software and database. Tests show that, from an alignment with just 20 mutations, MLOGD can discriminate non-overlapping CDSs from non-coding ORFs with a typical accuracy of up to 98%, and can detect CDSs overlapping known CDSs with a typical accuracy of 90%. In addition, the software produces a variety of statistics and graphics, useful for analysing an input multiple sequence alignment. Conclusion: MLOGD is an easy-to-use tool for virus genome annotation, detecting new CDSs - in particular overlapping or short CDSs - and for analysing overlapping CDSs following frameshift sites. The software, web-server, database and supplementary material are available at http:// guinevere.otago.ac.nz/mlogd.html. © 2006 Firth and Brown, licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Firth, A. E., & Brown, C. M. (2006). Detecting overlapping coding sequences in virus genomes. BMC Bioinformatics, 7. https://doi.org/10.1186/1471-2105-7-75

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free