Mining uncertain data streams using clustering feature decision trees

Wenhua Xu; Zheng Qin; Hao Hu; Nan Zhao

Conference Proceedings

Mining uncertain data streams using clustering feature decision trees

Xu W
Qin Z
Hu H
et al.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7121 LNAI(PART 2) 195-208

DOI: 10.1007/978-3-642-25856-5_15

6Citations

5Readers

Get full text

Abstract

During the last decade, classification from data streams is based on deterministic learning algorithms which learn from precise and complete data. However, a multitude of practical applications only supply approximate measurements. Usually, the estimated errors of the measurements are available. The development of highly efficient algorithms dealing with uncertain examples has emerged as an new direction. In this paper, we build a CFDTu model from data streams having uncertain attribute values. CFDTu applies an uncertain clustering algorithm that scans the data stream only once to obtain the sufficient statistical summaries. The statistics are stored in the Clustering Feature vectors, and are used for incremental decision tree induction. The vectors also serve as classifiers at the leaves to further refine the classification and reinforce any-time property. Experiments show that CFDTu outperforms a purely deterministic method in terms of accuracy and is highly scalable on uncertain data streams. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Xu, W., Qin, Z., Hu, H., & Zhao, N. (2011). Mining uncertain data streams using clustering feature decision trees. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7121 LNAI, pp. 195–208). https://doi.org/10.1007/978-3-642-25856-5_15

Mining uncertain data streams using clustering feature decision trees

Abstract

Cite

Register to see more suggestions