A sampling based algorithm for finding association rules from uncertain data

Zhu Qian; Pan Donghua; Yang Guangfei

Conference Proceedings

A sampling based algorithm for finding association rules from uncertain data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6319 LNAI(PART 1) 124-131

DOI: 10.1007/978-3-642-16530-6_16

0Citations

1Readers

Get full text

Abstract

Since there are many real-life situations in which people are uncertain about the content of transactions, association rule mining with uncertain data is in demand. Most of these studies focus on the improvement of classical algorithms for frequent itemsets mining. To obtain a tradeoff between the accuracy and computation time, in this paper we introduces an efficient algorithm for finding association rules from uncertain data with sampling-SARMUT, which is based on the FAST algorithm introduced by Chen et al. Unlike FAST, SARMUT is designed for uncertain data mining. In response to the special characteristics of uncertainty, we propose a new definition of "distance" as a measure to pick representative transactions. To evaluate its performance and accuracy, a comparison against the natural extension of FAST is performed using synthetic datasets. The experimental results show that the proposed sampling algorithm SARMUT outperforms FAST algorithm, and achieves up to 97% accuracy in some cases. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Qian, Z., Donghua, P., & Guangfei, Y. (2010). A sampling based algorithm for finding association rules from uncertain data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6319 LNAI, pp. 124–131). https://doi.org/10.1007/978-3-642-16530-6_16

A sampling based algorithm for finding association rules from uncertain data

Abstract

Author supplied keywords

Cite

Register to see more suggestions