Optimal gap-affine alignment in O(s) space

Santiago Marco-Sola; Jordan M. Eizenga; Andrea Guarracino; Benedict Paten; Erik Garrison; Miquel Moreto

Journal ArticleOPEN ACCESS

Optimal gap-affine alignment in O(s) space

Bioinformatics (2023) 39(2)

DOI: 10.1093/bioinformatics/btad074

44Citations

28Readers

Abstract

Motivation: Pairwise sequence alignment remains a fundamental problem in computational biology and bioinformatics. Recent advances in genomics and sequencing technologies demand faster and scalable algorithms that can cope with the ever-increasing sequence lengths. Classical pairwise alignment algorithms based on dynamic programming are strongly limited by quadratic requirements in time and memory. The recently proposed wavefront alignment algorithm (WFA) introduced an efficient algorithm to perform exact gap-affine alignment in OðnsÞ time, where s is the optimal score and n is the sequence length. Notwithstanding these bounds, WFA’s Oðs2Þ memory requirements become computationally impractical for genome-scale alignments, leading to a need for further improvement. Results: In this article, we present the bidirectional WFA algorithm, the first gap-affine algorithm capable of computing optimal alignments in OðsÞ memory while retaining WFA’s time complexity of OðnsÞ. As a result, this work improves the lowest known memory bound OðnÞ to compute gap-affine alignments. In practice, our implementation never requires more than a few hundred MBs aligning noisy Oxford Nanopore Technologies reads up to 1 Mbp long while maintaining competitive execution times.

Cite

CITATION STYLE

APA

Marco-Sola, S., Eizenga, J. M., Guarracino, A., Paten, B., Garrison, E., & Moreto, M. (2023). Optimal gap-affine alignment in O(s) space. Bioinformatics, 39(2). https://doi.org/10.1093/bioinformatics/btad074

Optimal gap-affine alignment in O(s) space

Abstract

Cite

Register to see more suggestions