AUC4.5: AUC-Based C4.5 Decision Tree Algorithm for Imbalanced Data Classification

47Citations
Citations of this article
92Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper presents a modification of Quinlan's C4.5 algorithm for imbalanced data classification. While the C4.5 algorithm uses the difference in information entropy to determine the goodness of a split, the proposed method, which is named AUC4.5, examines the difference in the area under the ROC curve (AUC) of a split. It implies that our method attempts to maximize the AUC value of a trained decision tree in order to cope with class imbalance in data. An extensive experimental study was performed on 20 real datasets from the machine learning repository at the University of California at Irvine, Irvine. The proposed AUC4.5 algorithm showed better classification than both the standard and cost-sensitive C4.5 algorithms.

Cite

CITATION STYLE

APA

Lee, J. S. (2019). AUC4.5: AUC-Based C4.5 Decision Tree Algorithm for Imbalanced Data Classification. IEEE Access, 7, 106034–106042. https://doi.org/10.1109/ACCESS.2019.2931865

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free