Data abstractions for numerical attributes in data mining

Masaaki Narita; Makoto Haraguchi; Yoshiaki Okubo

Conference Proceedings

Data abstractions for numerical attributes in data mining

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2412 35-42

DOI: 10.1007/3-540-45675-9_7

1Citations

4Readers

Get full text

Abstract

In this paper, we investigate data abstractions for mining association rules with numerical conditions and boolean consequents as a target class. The act of our abstraction corresponds to joining some consecutive primitive intervals of a numerical attribute. If the interclass variance for two adjacent intervals is less than a given admissible upper-bound ∈, then they are combined together into an extended interval. Intuitively speaking, a low value of the variance means that the two intervals can provide almost the same posterior class distributions. This implies few properties or characteristics about the class would be lost by combining such intervals together. We discuss a bottom-up-method for finding maximally extended intervals, called maximal appropriate abstraction. Based on such an abstraction, we can reduce the number of extracted rules, still preserving almost the same quality of the rules extracted without abstractions. The usefulness of our abstraction method is shown by preliminary experimental results.

Cite

CITATION STYLE

APA

Narita, M., Haraguchi, M., & Okubo, Y. (2002). Data abstractions for numerical attributes in data mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2412, pp. 35–42). Springer Verlag. https://doi.org/10.1007/3-540-45675-9_7

Data abstractions for numerical attributes in data mining

Abstract

Cite

Register to see more suggestions