Alignment-free visualization of metagenomic data by nonlinear dimension reduction

43Citations
Citations of this article
241Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The visualization of metagenomic data, especially without prior taxonomic identification of reconstructed genomic fragments, is a challenging problem in computational biology. An ideal visualization method should, among others, enable clear distinction of congruent groups of sequences of closely related taxa, be applicable to fragments of lengths typically achievable following assembly, and allow the efficient analysis of the growing amounts of community genomic sequence data. Here, we report a scalable approach for the visualization of metagenomic data that is based on nonlinear dimension reduction via Barnes-Hut Stochastic Neighbor Embedding of centered log-ratio transformed oligonucleotide signatures extracted from assembled genomic sequence fragments. The approach allows for alignment-free assessment of the data-inherent taxonomic structure, and it can potentially facilitate the downstream binning of genomic fragments into uniform clusters reflecting organismal origin. We demonstrate the performance of our approach by visualizing community genomic sequence data from simulated as well as groundwater, human-derived and marine microbial communities.

Cite

CITATION STYLE

APA

Laczny, C. C., Pinel, N., Vlassis, N., & Wilmes, P. (2014). Alignment-free visualization of metagenomic data by nonlinear dimension reduction. Scientific Reports, 4. https://doi.org/10.1038/srep04516

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free