Data scaling performance on various machine learning algorithms to identify abalone sex

Willdan Aprizal Arifin; Ishak Ariawan; Ayang Armelita Rosalia; Lukman Lukman; Nabila Tufailah

Journal ArticleOPEN ACCESS

Data scaling performance on various machine learning algorithms to identify abalone sex

Arifin W
Ariawan I
Rosalia A
et al.

Jurnal Teknologi dan Sistem Komputer (2022) 10(1) 26-31

DOI: 10.14710/jtsiskom.2021.14105

N/ACitations

65Readers

Abstract

This study aims to analyze the performance of machine learning algorithms with the data scaling process to show the method's effectiveness. It uses min-max (normalization) and zero-mean (standardization) data scaling techniques in the abalone dataset. The stages carried out in this study included data normalization on the data of abalone physical measurement features. The model evaluation was carried out using k-fold cross-validation with the number of k-fold 10. Abalone datasets were normalized in machine learning algorithms: Random Forest, Naïve Bayesian, Decision Tree, and SVM (RBF kernels and linear kernels). The eight features of the abalone dataset show that machine learning algorithms did not too influence data scaling. There is an increase in the performance of SVM, while Random Forest decreases when the abalone dataset is applied to data scaling. Random Forest has the highest average balanced accuracy (74.87%) without data scaling.

Cite

CITATION STYLE

APA

Arifin, W. A., Ariawan, I., Rosalia, A. A., Lukman, L., & Tufailah, N. (2022). Data scaling performance on various machine learning algorithms to identify abalone sex. Jurnal Teknologi Dan Sistem Komputer, 10(1), 26–31. https://doi.org/10.14710/jtsiskom.2021.14105

Data scaling performance on various machine learning algorithms to identify abalone sex

Abstract

Cite

Register to see more suggestions