Automatic ICD-10 coding and training system: Deep neural network based on supervised learning

38Citations
Citations of this article
52Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Background: The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. Objective: This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. Methods: We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F1-score and the coding time by coders before and after using our model. Results: In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F1-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F1-score significantly increased from a median of 0.832 to 0.922 (P

References Powered by Scopus

GloVe: Global vectors for word representation

26880Citations
N/AReaders
Get full text

BioBERT: A pre-trained biomedical language representation model for biomedical text mining

3849Citations
N/AReaders
Get full text

Scalable and accurate deep learning with electronic health records

1716Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Innovating Personalized Nephrology Care: Exploring the Potential Utilization of ChatGPT

20Citations
N/AReaders
Get full text

Natural Language Processing Techniques for Text Classification of Biomedical Documents: A Systematic Review

11Citations
N/AReaders
Get full text

Deep Learning Analysis of Polish Electronic Health Records for Diagnosis Prediction in Patients with Cardiovascular Diseases

11Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Chen, P. F., Wang, S. M., Liao, W. C., Kuo, L. C., Chen, K. C., Lin, Y. C., … Lai, F. (2021). Automatic ICD-10 coding and training system: Deep neural network based on supervised learning. JMIR Medical Informatics, 9(8). https://doi.org/10.2196/23230

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 16

80%

Researcher 3

15%

Lecturer / Post doc 1

5%

Readers' Discipline

Tooltip

Computer Science 8

53%

Medicine and Dentistry 3

20%

Pharmacology, Toxicology and Pharmaceut... 2

13%

Nursing and Health Professions 2

13%

Save time finding and organizing research with Mendeley

Sign up for free