Detecting a compact decision tree based on an appropriate abstraction

Yoshimitsu Kudoh; Makoto Haraguchi

Conference Proceedings

Detecting a compact decision tree based on an appropriate abstraction

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2000) 1983 60-70

DOI: 10.1007/3-540-44491-2_10

4Citations

2Readers

Get full text

Abstract

It is generally convinced that pre-processing for data mining is needed to exclude irrelevant and meaningless aspects of data before applying data mining algorithms. From this viewpoint, we have already proposed a notion of Information Theoretical Abstraction, and implemented a system ITA. Given a relational database and a family of possible abstractions for its attribute values, called anabstraction hierachy ITA selects the best abstraction among the possible ones so that class distributions needed to perform our clasification task are preserved as possibly as we can. According to our previous experiment, just one application of abstraction for the whole database has shown its effectiveness in reducing the size of detected rules, without making the classification error worse. However, as C4.5 performs serial attribute-selection repeatedly, ITA does not generally guarantee the preservingness of class distributions, given a sequence of attribute-selections. For this reason, in this paper, we propose a new version of ITA, called iterative ITA, so that it tries to keep the class distributions in each attribute selection step as possibly as we can.

Cite

CITATION STYLE

APA

Kudoh, Y., & Haraguchi, M. (2000). Detecting a compact decision tree based on an appropriate abstraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1983, pp. 60–70). Springer Verlag. https://doi.org/10.1007/3-540-44491-2_10

Detecting a compact decision tree based on an appropriate abstraction

Abstract

Cite

Register to see more suggestions