Dynamic Pooling Improves Nanopore Base Calling Accuracy

Vladimir Boza; Peter Peresini; Brona Brejova; Tomas Vinar

Journal ArticleOPEN ACCESS

Dynamic Pooling Improves Nanopore Base Calling Accuracy

IEEE/ACM Transactions on Computational Biology and Bioinformatics (2022) 19(6) 3416-3424

DOI: 10.1109/TCBB.2021.3128366

3Citations

25Readers

Abstract

In nanopore sequencing, electrical signal is measured as DNA molecules pass through the sequencing pores. Translating these signals into DNA bases (base calling) is a highly non-trivial task, and its quality has a large impact on the sequencing accuracy. The most successful nanopore base callers to date use convolutional neural networks (CNN) to accomplish the task. Convolutional layers in CNNs are typically composed of filters with constant window size, performing best in analysis of signals with uniform speed. However, the speed of nanopore sequencing varies greatly both within reads and between sequencing runs. Here, we present dynamic pooling, a novel neural network component, which addresses this problem by adaptively adjusting the pooling ratio. To demonstrate the usefulness of dynamic pooling, we developed two base callers: Heron and Osprey. Heron improves the accuracy beyond the experimental high-accuracy base caller Bonito developed by Oxford Nanopore. Osprey is a fast base caller that can compete in accuracy with Guppy high-accuracy mode, but does not require GPU acceleration and achieves a near real-time speed on common desktop CPUs. Availability: https://github.com/fmfi-compbio/osprey, https://github.com/fmfi-compbio/heron.

Author supplied keywords

Cite

CITATION STYLE

APA

Boza, V., Peresini, P., Brejova, B., & Vinar, T. (2022). Dynamic Pooling Improves Nanopore Base Calling Accuracy. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 19(6), 3416–3424. https://doi.org/10.1109/TCBB.2021.3128366

Dynamic Pooling Improves Nanopore Base Calling Accuracy

Abstract

Author supplied keywords

Cite

Register to see more suggestions