A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads

Yuan Zhang; Yanni Sun; James R. Cole

Journal ArticleOPEN ACCESS

A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads

Bioinformatics (2013) 29(17) 2103-2111

DOI: 10.1093/bioinformatics/btt357

11Citations

47Readers

Abstract

Motivation: Protein domain classification is an important step in functional annotation for next-generation sequencing data. For RNA-Seq data of non-model organisms that lack quality or complete reference genomes, existing protein domain analysis pipelines are applied to short reads directly or to contigs that are generated using de novo sequence assembly tools. However, these strategies do not provide satisfactory performance in classifying short reads into their native domain families.Results: We introduce SALT, a protein domain classification tool based on profile hidden Markov models and graph algorithms. SALT carefully incorporates the characteristics of reads that are sequenced from the domain regions and assembles them into contigs based on a supervised graph construction algorithm. We applied SALT to two RNA-Seq datasets of different read lengths and quantified its performance using the available protein domain annotations and the reference genomes. Compared with existing strategies, SALT showed better sensitivity and accuracy. In the third experiment, we applied SALT to a non-model organism. The experimental results demonstrated that it identified more transcribed protein domain families than other tested classifiers. Availability: The source code and supplementary data are available at https://sourceforge.net/projects/salt1/Contact: Supplementary information: Supplementary data are available at Bioinformatics online. © 2013 The Author.

Cite

CITATION STYLE

APA

Zhang, Y., Sun, Y., & Cole, J. R. (2013). A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads. Bioinformatics, 29(17), 2103–2111. https://doi.org/10.1093/bioinformatics/btt357

A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads

Abstract

Cite

Register to see more suggestions