FMLC: Fast multi-level clustering and visualization of large molecular datasets

12Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation Despite successful applications of data clustering and visualization techniques in molecular sequence identification, current technologies still do not scale to large biological datasets. Results We address this problem by a new multi-threaded tool, fMLC, primarily developed to cluster DNA sequences, that is supplemented with an interactive web-based visualization component, DiVE. fMLC enabled to compare, cluster and visualize 350K ITS fungal sequences at the species level. It took less than two hours to compare and cluster the dataset, which is twelve times faster than the time reported previously. Availability and implementation https://github.com/FastMLC/fMLC (doi: 10.5281/zenodo.926820) Contact d.vu@westerdijkinstitute.nl or v.robert@westerdijkinstitute.nl.

Cite

CITATION STYLE

APA

Vu, D., Georgievska, S., Szoke, S., Kuzniar, A., & Robert, V. (2018). FMLC: Fast multi-level clustering and visualization of large molecular datasets. Bioinformatics, 34(9), 1577–1579. https://doi.org/10.1093/bioinformatics/btx810

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free