AC-DIAMOND: Accelerating protein alignment via better SIMD parallelization and space-efficient indexing

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

To speed up the alignment of DNA reads or assembled contigs against a protein database has been a challenge up to now. The recent tool DIAMOND has significantly improved the speed of BLASTX and RAPSearch, while giving similar degree of sensitivity. Yet for applications like metagenomics, where large amount of data is involved, DIAMOND still takes a lot of time. This paper introduces an even faster protein alignment tool, called AC-DIAMOND, which attempts to speed up DIAMOND via better SIMD parallelization and more space-efficient indexing of the reference database; the latter allows more queries to be loaded into the memory and processed together. Experimental results show that ACDIAMOND is about 4 times faster than DIAMOND on aligning DNA reads or contigs, while retaining the same sensitivity as DIAMOND.For example, the latest assembly of the Iowa praire soil metagenomic dataset generates over 9 milllion of contigs, with a total size about 7Gbp; when aligning these contigs to the protein database NCBI-nr, DIAMOND takes 4 to 5 days, and AC-DIAMOND takes about 1 day. AC-DIAMOND is available for testing at http://ac-diamond.sourceforge.net.

Cite

CITATION STYLE

APA

Mai, H., Li, D., Zhang, Y., Leung, H. C. M., Luo, R., Ting, H. F., & Lam, T. W. (2016). AC-DIAMOND: Accelerating protein alignment via better SIMD parallelization and space-efficient indexing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9656, pp. 426–433). Springer Verlag. https://doi.org/10.1007/978-3-319-31744-1_38

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free