Abstract
Lung cancer is a malignant disease that im-poses serious complications restricting patients from performing daily tasks in the early stages and eventu-ally cause their death. The prevalence of this disease has been highlighted by numerous statistics worldwide. The preemptive diagnosis of individuals with lung can-cer can enhance chances of prevention and treatment. Therefore, the purpose of this study is to predict lung cancer preemptively utilizing simple clinical and demo-graphical features obtained from the 'data world' website. The experiment was conducted using Support Vector Machine (SVM), K-Nearest Neighbor (K-NN), and Logistic Regression (LR) classifiers. To improve models' accuracy, SMOTETomek was employed along with GridsearchCV to tune hyperparameters. The Re-cursive Feature Elimination method was also utilized to find the best feature subset. Results indicated that SVM achieved the best performance with 98.33% recall, 96.72% precision, and an accuracy of 97.27% using 15 attributes.
Author supplied keywords
Cite
CITATION STYLE
Olatunji, S. O., Alansari, A., Alkhorasani, H., Alsubaii, M., Sakloua, R., Alzahrani, R., … Ahmed, M. I. B. (2022). Machine Learning Based Preemptive Diagnosis of Lung Cancer Using Clinical Data. In Proceedings - 2022 7th International Conference on Data Science and Machine Learning Applications, CDMA 2022 (pp. 115–120). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/CDMA54072.2022.00024
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.