Genome-wide mutation scoring for machine-learning-based antimicrobial resistance prediction

Peter Májek; Lukas Lüftinger; Stephan Beisken; Thomas Rattei; Arne Materna

Journal ArticleOPEN ACCESS

Genome-wide mutation scoring for machine-learning-based antimicrobial resistance prediction

International Journal of Molecular Sciences (2021) 22(23)

DOI: 10.3390/ijms222313049

13Citations

38Readers

Abstract

The prediction of antimicrobial resistance (AMR) based on genomic information can improve patient outcomes. Genetic mechanisms have been shown to explain AMR with accuracies in line with standard microbiology laboratory testing. To translate genetic mechanisms into phenotypic AMR, machine learning has been successfully applied. AMR machine learning models typically use nucleotide k-mer counts to represent genomic sequences. While k-mer representation efficiently captures sequence variation, it also results in high-dimensional and sparse data. With limited training data available, achieving acceptable model performance or model interpretability is challenging. In this study, we explore the utility of feature engineering with several biologically relevant signals. We propose to predict the functional impact of observed mutations with PROVEAN to use the predicted impact as a new feature for each protein in an organism’s proteome. The addition of the new features was tested on a total of 19,521 isolates across nine clinically relevant pathogens and 30 different antibiotics. The new features significantly improved the predictive performance of trained AMR models for Pseudomonas aeruginosa, Citrobacter freundii, and Escherichia coli. The balanced accuracy of the respective models of those three pathogens improved by 6.0% on average.

Author supplied keywords

Cite

CITATION STYLE

APA

Májek, P., Lüftinger, L., Beisken, S., Rattei, T., & Materna, A. (2021). Genome-wide mutation scoring for machine-learning-based antimicrobial resistance prediction. International Journal of Molecular Sciences, 22(23). https://doi.org/10.3390/ijms222313049

Genome-wide mutation scoring for machine-learning-based antimicrobial resistance prediction

Abstract

Author supplied keywords

Cite

Register to see more suggestions