Apache Spark Implementations for String Patterns in DNA Sequences

Andreas Kanavos; Ioannis Livieris; Phivos Mylonas; Spyros Sioutas; Gerasimos Vonitsanos

Book Chapter

Apache Spark Implementations for String Patterns in DNA Sequences

Springer, (2020), 439-453

DOI: 10.1007/978-3-030-32622-7_42

1Citations

2Readers

Get full text

Abstract

The availability of numerical data grows from 1 day to another in a remarkable way. New technologies of high-throughput Next-Generation Sequencing (NGS) are producing DNA sequences. Next-Generation Sequencing describes a DNA sequencing technology which has revolutionized genomic research. In this paper, we perform some experiments using a cloud infrastructure framework, namely, Apache Spark, in some sequences derived from the National Center for Biotechnology Information (NCBI). The problems we examine are some of the most popular ones, namely, Longest Common Prefix, Longest Common Substring, and Longest Common Subsequence.

Author supplied keywords

Cite

CITATION STYLE

APA

Kanavos, A., Livieris, I., Mylonas, P., Sioutas, S., & Vonitsanos, G. (2020). Apache Spark Implementations for String Patterns in DNA Sequences. In Advances in Experimental Medicine and Biology (Vol. 1194, pp. 439–453). Springer. https://doi.org/10.1007/978-3-030-32622-7_42

Apache Spark Implementations for String Patterns in DNA Sequences

Abstract

Author supplied keywords

Cite

Register to see more suggestions