Detecting a compact decision tree based on an appropriate abstraction

4Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

It is generally convinced that pre-processing for data mining is needed to exclude irrelevant and meaningless aspects of data before applying data mining algorithms. From this viewpoint, we have already proposed a notion of Information Theoretical Abstraction, and implemented a system ITA. Given a relational database and a family of possible abstractions for its attribute values, called anabstraction hierachy ITA selects the best abstraction among the possible ones so that class distributions needed to perform our clasification task are preserved as possibly as we can. According to our previous experiment, just one application of abstraction for the whole database has shown its effectiveness in reducing the size of detected rules, without making the classification error worse. However, as C4.5 performs serial attribute-selection repeatedly, ITA does not generally guarantee the preservingness of class distributions, given a sequence of attribute-selections. For this reason, in this paper, we propose a new version of ITA, called iterative ITA, so that it tries to keep the class distributions in each attribute selection step as possibly as we can.

Cite

CITATION STYLE

APA

Kudoh, Y., & Haraguchi, M. (2000). Detecting a compact decision tree based on an appropriate abstraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1983, pp. 60–70). Springer Verlag. https://doi.org/10.1007/3-540-44491-2_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free