mapAlign: An Efficient Approach for Mapping and Aligning Long Reads to Reference Genomes

Wen Yang; Lusheng Wang

Conference Proceedings

mapAlign: An Efficient Approach for Mapping and Aligning Long Reads to Reference Genomes

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12304 LNBI 105-118

DOI: 10.1007/978-3-030-57821-3_10

0Citations

1Readers

Get full text

Abstract

Long reads play an important role for the identification of structural variants, sequencing repetitive regions, phasing of alleles, etc. In this paper, we propose a new approach for mapping long reads to reference genomes. We also propose a new method to generate accurate alignments of the long reads and the corresponding segments of reference genome. The new mapping algorithm is based on the longest common sub-sequence with distance constraints. The new (local) alignment algorithms is based on the idea of recursive alignment of variable size k-mers. Experiments show that our new method can generate better alignments in terms of both identity and alignment scores for both Nanopore and SMRT data sets. In particular, our method can align 91.53% and 85.36% of letters on reads to identical letters on reference genomes for human individuals of Nanopore and SMRT data sets, respectively. The state-of-the-art method can only align 88.44% and 79.08% letters of reads for Nanopore and SMRT data sets, respectively. Our method is also faster than the state-of-the-art method. Availability: https://github.com/yw575/mapAlign

Author supplied keywords

Cite

CITATION STYLE

APA

Yang, W., & Wang, L. (2020). mapAlign: An Efficient Approach for Mapping and Aligning Long Reads to Reference Genomes. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12304 LNBI, pp. 105–118). Springer. https://doi.org/10.1007/978-3-030-57821-3_10

mapAlign: An Efficient Approach for Mapping and Aligning Long Reads to Reference Genomes

Abstract

Author supplied keywords

Cite

Register to see more suggestions