Finding falls in ambulatory care clinical documents using statistical text mining

26Citations
Citations of this article
90Readers
Mendeley users who have this article in their library.

Abstract

Objective: To determine how well statistical text mining (STM) models can identify falls within clinical text associated with an ambulatory encounter. Materials and Methods: 2241 patients were selected with a fall-related ICD-9-CM E-code or matched injury diagnosis code while being treated as an outpatient at one of four sites within the Veterans Health Administration. All clinical documents within a 48-h window of the recorded E-code or injury diagnosis code for each patient were obtained (n=26 010; 611 distinct document titles) and annotated for falls. Logistic regression, support vector machine, and cost-sensitive support vector machine (SVM-cost) models were trained on a stratified sample of 70% of documents from one location (dataset Atrain) and then applied to the remaining unseen documents (datasets Atest-D). Results: All three STM models obtained area under the receiver operating characteristic curve (AUC) scores above 0.950 on the four test datasets (Atest-D). The SVM-cost model obtained the highest AUC scores, ranging from 0.953 to 0.978. The SVM-cost model also achieved F-measure values ranging from 0.745 to 0.853, sensitivity from 0.890 to 0.931, and specificity from 0.877 to 0.944. Discussion: The STM models performed well across a large heterogeneous collection of document titles. In addition, the models also generalized across other sites, including a traditionally bilingual site that had distinctly different grammatical patterns. Conclusions: The results of this study suggest STM-based models have the potential to improve surveillance of falls. Furthermore, the encouraging evidence shown here that STM is a robust technique for mining clinical documents bodes well for other surveillance-related topics.

References Powered by Scopus

Support-Vector Networks

45792Citations
N/AReaders
Get full text

A Coefficient of Agreement for Nominal Scales

31721Citations
N/AReaders
Get full text

Bagging predictors

19043Citations
N/AReaders
Get full text

Cited by Powered by Scopus

The Secondary Use of Electronic Health Records for Data Mining: Data Characteristics and Challenges

60Citations
N/AReaders
Get full text

A novel method of adverse event detection can accurately identify venous thromboembolisms (VTEs) from narrative electronic health record data

55Citations
N/AReaders
Get full text

Machine learning approaches to analysing textual injury surveillance data: A systematic review

55Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

McCart, J. A., Berndt, D. J., Jarman, J., Finch, D. K., & Luther, S. L. (2013). Finding falls in ambulatory care clinical documents using statistical text mining. Journal of the American Medical Informatics Association, 20(5), 906–914. https://doi.org/10.1136/amiajnl-2012-001334

Readers over time

‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘2405101520

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 30

60%

Researcher 12

24%

Professor / Associate Prof. 6

12%

Lecturer / Post doc 2

4%

Readers' Discipline

Tooltip

Medicine and Dentistry 27

57%

Computer Science 10

21%

Engineering 6

13%

Nursing and Health Professions 4

9%

Save time finding and organizing research with Mendeley

Sign up for free
0