Next-generation sequencing revolution through big data analytics

Rashmi Tripathi; Pawan Sharma; Pavan Chakraborty; Pritish Kumar Varadwaj

Journal ArticleOPEN ACCESS

Next-generation sequencing revolution through big data analytics

Frontiers in Life Science (2016) 9(2) 119-149

DOI: 10.1080/21553769.2016.1178180

44Citations

152Readers

Abstract

Next-generation sequencing (NGS) technology has led to an unrivaled explosion in the amount of genomic data and this escalation has collaterally raised the challenges of sharing, archiving, integrating and analyzing these data. The scale and efficiency of NGS have posed a challenge for analysis of these vast genomic data, gene interactions, annotations and expression studies. However, this limitation of NGS can be safely overcome by tools and algorithms using big data framework. Based on this framework, here we have reviewed the current state of knowledge of big data algorithms for NGS to reveal hidden patterns in sequencing, analysis and annotation, and so on. The APACHE-based Hadoop framework gives an on-interest and adaptable environment for substantial scale data analysis. It has several components for partitioning of large-scale data onto clusters of commodity hardware, in a fault-tolerant manner. Packages like MapReduce, Cloudburst, Crossbow, Myrna, Eoulsan, DistMap, Seal and Contrail perform various NGS applications, such as adapter trimming, quality checking, read mapping, de novo assembly, quantification, expression analysis, variant analysis, and annotation. This review paper deals with the current applications of the Hadoop technology with their usage and limitations in perspective of NGS.

Author supplied keywords

Cite

CITATION STYLE

APA

Tripathi, R., Sharma, P., Chakraborty, P., & Varadwaj, P. K. (2016). Next-generation sequencing revolution through big data analytics. Frontiers in Life Science, 9(2), 119–149. https://doi.org/10.1080/21553769.2016.1178180

Next-generation sequencing revolution through big data analytics

Abstract

Author supplied keywords

Cite

Register to see more suggestions