A machine learning approach to identification of unhealthy drinking

17Citations
Citations of this article
45Readers
Mendeley users who have this article in their library.

Abstract

Introduction: Unhealthy drinking is prevalent in the United States, and yet it is underidentified and undertreated. Identifying unhealthy drinkers can be time-consuming and uncomfortable for primary care providers. An automated rule for identification would focus attention on patients most likely to need care and, therefore, increase efficiency and effectiveness. The objective of this study was to build a clinical prediction tool for unhealthy drinking based on routinely available demographic and laboratory data. Methods: We obtained 38 demographic and laboratory variables from the National Health and Nutrition Examination Survey (1999 to 2016) on 43,545 nationally representative adults who had information on alcohol use available as a reference standard. Logistic regression, support vector machines, k-nearest neighbor, neural networks, decision trees, and random forests were used to build clinical prediction models. The model with the largest area under the receiver operator curve was selected to build the prediction tool. Results: A random forest model with 15 variables produced the largest area under the receiver operator curve (0.78) in the test set. The most influential predictors were age, current smoker, hemoglobin, sex, and high-density lipoprotein. The optimum operating point had a sensitivity of 0.50, specificity of 0.86, positive predictive value of 0.55, and negative predictive value of 0.83. Application of the tool resulted in a much smaller target sample (75% reduced). Conclusion: Using commonly available data, a decision tool can identify a subset of patients who seem to warrant clinical attention for unhealthy drinking, potentially increasing the efficiency and reach of screening.

References Powered by Scopus

Random forests

106516Citations
N/AReaders
Get full text

The meaning and use of the area under a receiver operating characteristic (ROC) curve

18445Citations
N/AReaders
Get full text

Induction of Decision Trees

16555Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Predicting the Risk of Alcohol Use Disorder Using Machine Learning: A Systematic Literature Review

15Citations
N/AReaders
Get full text

Patterns of high-risk drinking among medical students: A web-based survey with machine learning

11Citations
N/AReaders
Get full text

Binge drinking in early adulthood: A machine learning approach

10Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Bonnell, L. N., Littenberg, B., Wshah, S. R., & Rose, G. L. (2020). A machine learning approach to identification of unhealthy drinking. Journal of the American Board of Family Medicine, 33(3), 397–406. https://doi.org/10.3122/jabfm.2020.03.190421

Readers over time

‘20‘21‘22‘23‘24‘2505101520

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 10

53%

Lecturer / Post doc 4

21%

Professor / Associate Prof. 3

16%

Researcher 2

11%

Readers' Discipline

Tooltip

Computer Science 5

36%

Medicine and Dentistry 4

29%

Pharmacology, Toxicology and Pharmaceut... 3

21%

Business, Management and Accounting 2

14%

Article Metrics

Tooltip
Mentions
Blog Mentions: 2

Save time finding and organizing research with Mendeley

Sign up for free
0