Analysis of nanopore data using hidden Markov models

Jacob Schreiber; Kevin Karplus

Journal ArticleOPEN ACCESS

Analysis of nanopore data using hidden Markov models

Bioinformatics (2015) 31(12) 1897-1903

DOI: 10.1093/bioinformatics/btv046

27Citations

97Readers

Abstract

Motivation: Nanopore-based sequencing techniques can reconstruct properties of biosequences by analyzing the sequence-dependent ionic current steps produced as biomolecules pass through a pore. Typically this involves alignment of new data to a reference, where both reference construction and alignment have been performed by hand. Results: We propose an automated method for aligning nanopore data to a reference through the use of hidden Markov models. Several features that arise from prior processing steps and from the class of enzyme used can be simply incorporated into the model. Previously, the M2MspA nanopore was shown to be sensitive enough to distinguish between cytosine, methylcytosine and hydroxymethylcytosine. We validated our automated methodology on a subset of that data by automatically calculating an error rate for the distinction between the three cytosine variants and show that the automated methodology produces a 2-3% error rate, lower than the 10% error rate from previous manual segmentation and alignment.

Cite

CITATION STYLE

APA

Schreiber, J., & Karplus, K. (2015). Analysis of nanopore data using hidden Markov models. Bioinformatics, 31(12), 1897–1903. https://doi.org/10.1093/bioinformatics/btv046

Analysis of nanopore data using hidden Markov models

Abstract

Cite

Register to see more suggestions