Differentiating Between COVID-19 and Tuberculosis Using Machine Learning and Natural Language Processing

2Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Over 10 million people around the world are affected by tuberculosis (TB) every year, making it a major global health concern. With the advent of the COVID-19 pandemic, TB services in many countries have been temporarily disrupted, leading to a potential delay in the diagnosis of TB cases and many cases going under the radar. Since both diseases sometimes present similarly and generally affect the lungs, there is also a risk of misdiagnosis. This study aims to analyse the differences between COVID-19 and TB in different patients, as a first step in the creation of a TB screening tool. 180 COVID-19 and 215 TB case reports were collected from ScienceDirect. Using Natural Language Processing tools, the patient's age, gender, and symptoms were extracted from each report. Tree-based machine learning algorithms were then used to classify each case report as belonging to either disease. Overall, the cases included 252 male and 117 female patients, with 26 cases not reporting the patient's sex. The patients' ages ranged from 0 to 95 years old, with a median age of 41.5. There were 33 cases with missing age values. The most frequent symptom in the TB cases was weight loss while most COVID-19 cases listed fever as a symptom. Of all algorithms implemented, XGBoost performed best in terms of ROC AUC (86.9 %) and F1-score macro (78%). The trained model is a good starting point, which can be used by medical staff to aid in referring potential TB patients in a timely manner. This could reduce the delay in TB diagnosis as well as the TB death toll, especially in highly infected countries.

Cite

CITATION STYLE

APA

Pholo, M. D., Hamam, Y., Khalaf, A. B., & Du, C. (2022). Differentiating Between COVID-19 and Tuberculosis Using Machine Learning and Natural Language Processing. Revue d’Intelligence Artificielle, 36(2), 313–318. https://doi.org/10.18280/ria.360216

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free