The C4.5 algorithm still has weaknesses in predicting or classifying data if a large number of classes are used which can lead to increased decision-making time. So an approach is needed to improve the performance of the C4.5 algorithm with the selected split attributes that use the application of the average gain value to help predictions. The C4.5 algorithm is one of the Decision Tree methods in the classification process using the information entropy concept. The C4.5 algorithm uses the split criteria from ID3, the Gain Ratio is a modification of the method. The ID3 algorithm uses Information Gain (IG) for the split attribute criteria, while the C4.5 algorithm with Gain Ratio (GR), where the root value comes from high gain. The conclusion of the tests that have been carried out using the Water Quality dataset in the C4.5 method has an accuracy rate of 91.30%, with a classification error rate of 8.70%. Successful implementation using the C4.5 method in predicting the Water Quality dataset.
CITATION STYLE
Mardiansyah, H., Zarlis, M., & Sitompul, O. S. (2021). Analysis of C4.5 Algorithm of Water Quality Dataset. In Journal of Physics: Conference Series (Vol. 1898). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1898/1/012002
Mendeley helps you to discover research relevant for your work.