Investigating ID3-induced rules from low-dimensional data cleaned by complete case analysis

Jeanette Auer; Richard Hall

Conference Proceedings

Investigating ID3-induced rules from low-dimensional data cleaned by complete case analysis

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2004) 3339 414-424

DOI: 10.1007/978-3-540-30549-1_37

1Citations

1Readers

Get full text

Abstract

While knowledge discovery in databases techniques require statistically complete data, real world data is incomplete, so it must be preprocessed using completeness approximation methods. The success of these methods is impacted by whether redundancy in large amounts of data overcomes incompleteness mechanisms. We investigate this impact by comparing rule sets induced from complete data with rule sets induced from incomplete data that is preprocessed using complete case analysis. To control the incomplete data construction, we apply the well-defined incompleteness mechanisms missing-at-random and missing-completely-at-random to complete data. Initial results indicate that a medium level of pattern redundancy fails to fully overcome incompleteness mechanisms, and that characterizing an appropriate redundancy threshold is non-trivial. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Auer, J., & Hall, R. (2004). Investigating ID3-induced rules from low-dimensional data cleaned by complete case analysis. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3339, pp. 414–424). Springer Verlag. https://doi.org/10.1007/978-3-540-30549-1_37

Investigating ID3-induced rules from low-dimensional data cleaned by complete case analysis

Abstract

Cite

Register to see more suggestions