Analysis of C4.5 Algorithm of Water Quality Dataset

5Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The C4.5 algorithm still has weaknesses in predicting or classifying data if a large number of classes are used which can lead to increased decision-making time. So an approach is needed to improve the performance of the C4.5 algorithm with the selected split attributes that use the application of the average gain value to help predictions. The C4.5 algorithm is one of the Decision Tree methods in the classification process using the information entropy concept. The C4.5 algorithm uses the split criteria from ID3, the Gain Ratio is a modification of the method. The ID3 algorithm uses Information Gain (IG) for the split attribute criteria, while the C4.5 algorithm with Gain Ratio (GR), where the root value comes from high gain. The conclusion of the tests that have been carried out using the Water Quality dataset in the C4.5 method has an accuracy rate of 91.30%, with a classification error rate of 8.70%. Successful implementation using the C4.5 method in predicting the Water Quality dataset.

Cite

CITATION STYLE

APA

Mardiansyah, H., Zarlis, M., & Sitompul, O. S. (2021). Analysis of C4.5 Algorithm of Water Quality Dataset. In Journal of Physics: Conference Series (Vol. 1898). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1898/1/012002

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free