Data scaling performance on various machine learning algorithms to identify abalone sex

  • Arifin W
  • Ariawan I
  • Rosalia A
  • et al.
N/ACitations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

This study aims to analyze the performance of machine learning algorithms with the data scaling process to show the method's effectiveness. It uses min-max (normalization) and zero-mean (standardization) data scaling techniques in the abalone dataset. The stages carried out in this study included data normalization on the data of abalone physical measurement features. The model evaluation was carried out using k-fold cross-validation with the number of k-fold 10. Abalone datasets were normalized in machine learning algorithms: Random Forest, Naïve Bayesian, Decision Tree, and SVM (RBF kernels and linear kernels). The eight features of the abalone dataset show that machine learning algorithms did not too influence data scaling. There is an increase in the performance of SVM, while Random Forest decreases when the abalone dataset is applied to data scaling. Random Forest has the highest average balanced accuracy (74.87%) without data scaling.

Cite

CITATION STYLE

APA

Arifin, W. A., Ariawan, I., Rosalia, A. A., Lukman, L., & Tufailah, N. (2022). Data scaling performance on various machine learning algorithms to identify abalone sex. Jurnal Teknologi Dan Sistem Komputer, 10(1), 26–31. https://doi.org/10.14710/jtsiskom.2021.14105

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free