We discuss two interpretations of missing attribute values, lost values and “do not care” conditions. Both interpretations may be used for data mining based on characteristic sets. On the other hand, maximal consistent blocks were originally defined for incomplete data sets with “do not care” conditions, using only lower and upper approximations. We extended definitions of maximal consistent blocks to both interpretations while using probabilistic approximations, a generalization of lower and upper approximations. Our main objective is to compare approximations based on characteristic sets with approximations based on maximal consistent blocks in terms of an error rate.
CITATION STYLE
Clark, P. G., Gao, C., Grzymala-Busse, J. W., & Mroczek, T. (2018). A comparison of characteristic sets and generalized maximal consistent blocks in mining incomplete data. In Communications in Computer and Information Science (Vol. 854, pp. 480–489). Springer Verlag. https://doi.org/10.1007/978-3-319-91476-3_40
Mendeley helps you to discover research relevant for your work.