Abstract
Background: Hypervirulent Klebsiella pneumoniae (hvKP) infections can have high morbidity and mortality rates owing to their invasiveness and virulence. However, there are no effective tools or biomarkers to discriminate between hvKP and nonhypervirulent K. pneumoniae (nhvKP) strains. We aimed to use a random forest algorithm to predict hvKP based on core-genome data. Methods: In total, 272 K. pneumoniae strains were collected from 20 tertiary hospitals in China and divided into hvKP and nhvKP groups according to clinical criteria. Clinical data comparisons, whole-genome sequencing, virulence profile analysis, and core genome multilocus sequence typing (cgMLST) were performed. We then established a random forest predictive model based on the cgMLST scheme to prospectively identify hvKP. The random forest is an ensemble learning method that generates multiple decision trees during the training process and each decision tree will output its own prediction results corresponding to the input. The predictive ability of the model was assessed by means of area under the receiver operating characteristic curve. Results: Patients in the hvKP group were younger than those in the nhvKP group (median age, 58.0 and 68.0 years, respectively; P
Author supplied keywords
Cite
CITATION STYLE
Lan, P., Shi, Q., Zhang, P., Chen, Y., Yan, R., Hua, X., … Yu, Y. (2020). Core genome allelic profiles of clinical klebsiella pneumoniae strains using a random forest algorithm based on multilocus sequence typing scheme for hypervirulence analysis. Journal of Infectious Diseases, 221, S263–S271. https://doi.org/10.1093/INFDIS/JIZ562
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.