Complexity of Rule Sets Induced from Incomplete Data Sets Using Global Probabilistic Approximations

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We consider incomplete data sets using two interpretations of missing attribute values: lost values and "do not care" conditions. Additionally, in our data mining experiments we use global probabilistic approximations (singleton, subset and concept). The results of validation of such data, using global probabilistic approximations, were published recently. A novelty of this paper is research on the complexity of corresponding rule sets, in terms of the number of rules and number of rule conditions. Our main result is that the simplest rule sets are induced from data sets in which missing attribute values are interpreted as "do not care" conditions where rule sets are induced using subset probabilistic approximations. © Springer International Publishing Switzerland 2014.

Cite

CITATION STYLE

APA

Clark, P. G., & Grzymala-Busse, J. W. (2014). Complexity of Rule Sets Induced from Incomplete Data Sets Using Global Probabilistic Approximations. In Communications in Computer and Information Science (Vol. 442 CCIS, pp. 386–395). Springer Verlag. https://doi.org/10.1007/978-3-319-08795-5_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free