MBBC: An efficient approach for metagenomic binning based on clustering

22Citations
Citations of this article
95Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Binning environmental shotgun reads is one of the most fundamental tasks in metagenomic studies, in which mixed reads from different species or operational taxonomical units (OTUs) are separated into different groups. While dozens of binning methods are available, there is still room for improvement. Results: We developed a novel taxonomy-independent approach called MBBC (etagenomic inning ased on lustering) to cluster environmental shotgun reads, by considering k-mer frequency in reads and Markov properties of the inferred OTUs. Tested on twelve simulated datasets, MBBC reliably estimated the species number, the genome size, and the relative abundance of each species, independent of whether there are errors in reads. Tested on multiple experimental datasets, MBBC outperformed two state-of-the-art taxonomy-independent methods, in terms of the accuracy of the estimated species number, genome sizes, and percentages of correctly assigned reads, among other metrics. Conclusions: We have developed a novel method for binning metagenomic reads based on clustering. This method is demonstrated to reliably predict species numbers, genome sizes, relative species abundances, and k-mer coverage in simple datasets. Our method also has a high accuracy in read binning. The MBBC software is freely available at http://eecs.ucf.edu/~xiaoman/MBBC/MBBC.html.

Cite

CITATION STYLE

APA

Wang, Y., Hu, H., & Li, X. (2015). MBBC: An efficient approach for metagenomic binning based on clustering. BMC Bioinformatics, 16(1). https://doi.org/10.1186/s12859-015-0473-8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free