Pruning decision trees with misclassification costs

Jeffrey P. Bradford; Clayton Kunz; Ron Kohavi; Cliff Brunk; Carla E. Brodley

Conference ProceedingsOPEN ACCESS

Pruning decision trees with misclassification costs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1998) 1398 131-136

DOI: 10.1007/bfb0026682

138Citations

81Readers

Abstract

We describe an experimental study of pruning methods for decision tree classifiers when the goal is minimizing loss rather than error. In addition to two common methods for errorminimization, CART's cost-complexity pruning and C4.5's error-based pruning, we study the extension of cost-complexity pruning to loss and one pruning variant based on the Laplace correction. We perform an empirical comparison of these methods and evaluate them with respect to loss. We found that applying the Laplace correction to estimate the probability distributions atthe leaves was beneficial to all pruning methods. Unlike in error minimization, and somewhat surprisingly, performing no pruning led to results that were on par with other methods in terms of the evaluation criteria. The main advantage of pruning was in the reduction of the decision tree size, sometimes by a factor of ten. While no method dominated others on all datasets,even for the same domain different pruning mechanisms are better for different loss matrices.

Cite

CITATION STYLE

APA

Bradford, J. P., Kunz, C., Kohavi, R., Brunk, C., & Brodley, C. E. (1998). Pruning decision trees with misclassification costs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1398, pp. 131–136). Springer Verlag. https://doi.org/10.1007/bfb0026682

Pruning decision trees with misclassification costs

Abstract

Cite

Register to see more suggestions