Data mining is often called knowledge Discovery in Database (KDD). Data mining is usually used to improve future decision making based on information obtained from the past. For example for prediction, estimation, association, clustering, and description. Stroke is the second most deadly disease in the world according to WHO. The sufferer has an injury to the nervous system. Because of this, health experts, especially in the field of nursing, need special attention. Currently, the development of the Industrial Revolution Era 4.0 is collaborating in the fieldsof technology and health science so that it becomes something useful by using Machine Learning. There are so many benefits that are used in predicting several diseases that can be anticipated. In this study the dataset is dividedinto 2 parts, namely training data and testing data using split validation. Based on the results of the test that have been carried out in this study, the algorithm that has the highest accuracyvalue on balanced data is Logistic Regression with an accuracy rate of 75.65%, while for unbalanced data, the algorithm that has the highest accuracy results is Logistic Regression, Random Forest, SVM, and KNN with an accuracy rate of 98.63%. This testing process is carried out to identify stroke with data mining algorithms
CITATION STYLE
Azhar, Y., Firdausy, A. K., & Amelia, P. J. (2022). Perbandingan Algoritma Klasifikasi Data Mining Untuk Prediksi Penyakit Stroke. SINTECH (Science and Information Technology) Journal, 5(2), 191–197. https://doi.org/10.31598/sintechjournal.v5i2.1222
Mendeley helps you to discover research relevant for your work.