Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants

Pornpat Athamanolap; Vishwa Parekh; Stephanie I. Fraley; Vatsal Agarwal; Dong J. Shin; Michael A. Jacobs; Tza Huei Wang; Samuel Yang

Journal ArticleOPEN ACCESS

Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants

PLoS ONE (2014) 9(10)

DOI: 10.1371/journal.pone.0109094

42Citations

72Readers

Abstract

High resolution melt (HRM) is gaining considerable popularity as a simple and robust method for genotyping sequence variants. However, accurate genotyping of an unknown sample for which a large number of possible variants may exist will require an automated HRM curve identification method capable of comparing unknowns against a large cohort of known sequence variants. Herein, we describe a new method for automated HRM curve classification based on machine learning methods and learned tolerance for reaction condition deviations. We tested this method in silico through multiple cross-validations using curves generated from 9 different simulated experimental conditions to classify 92 known serotypes of Streptococcus pneumoniae and demonstrated over 99% accuracy with 8 training curves per serotype. In vitro verification of the algorithm was tested using sequence variants of a cancer-related gene and demonstrated 100% accuracy with 3 training curves per sequence variant. The machine learning algorithm enabled reliable, scalable, and automated HRM genotyping analysis with broad potential clinical and epidemiological applications.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Athamanolap, P., Parekh, V., Fraley, S. I., Agarwal, V., Shin, D. J., Jacobs, M. A., … Yang, S. (2014). Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants. PLoS ONE, 9(10). https://doi.org/10.1371/journal.pone.0109094

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 33

61%

Researcher 16

30%

Lecturer / Post doc 4

Professor / Associate Prof. 1

Readers' Discipline

Biochemistry, Genetics and Molecular Bi... 14

32%

Agricultural and Biological Sciences 11

25%

Medicine and Dentistry 10

23%

Engineering 9

20%

Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants

Abstract

References Powered by Scopus

Gapped BLAST and PSI-BLAST: A new generation of protein database search programs

Support-Vector Networks

Nearest Neighbor Pattern Classification

Cited by Powered by Scopus

Nanoarray Digital Polymerase Chain Reaction with High-Resolution Melt for Enabling Broad Bacteria Identification and Pheno-Molecular Antimicrobial Susceptibility Test

Integrated Bacterial Identification and Antimicrobial Susceptibility Testing Using PCR and High-Resolution Melt

Emerging Analytical Techniques for Rapid Pathogen Identification and Susceptibility Testing

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline