Reduced data sets and entropy-based discretization

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Results of experiments on numerical data sets discretized using two methods-global versions of Equal Frequency per Interval and Equal IntervalWidth-are presented. Globalization of both methods is based on entropy. For discretized data sets left and right reducts were computed. For each discretized data set and two data sets, based, respectively, on left and right reducts, we applied ten-fold cross validation using the C4.5 decision tree generation system. Our main objective was to compare the quality of all three types of data sets in terms of an error rate. Additionally, we compared complexity of generated decision trees. We show that reduction of data sets may only increase the error rate and that the decision trees generated from reduced decision sets are not simpler than the decision trees generated from non-reduced data sets.

Cite

CITATION STYLE

APA

Grzymala-Busse, J. W., Hippe, Z. S., & Mroczek, T. (2019). Reduced data sets and entropy-based discretization. Entropy, 21(11). https://doi.org/10.3390/e21111051

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free