List representation applied to sparse datacubes for data warehousing and data mining

Frank Wang; Frahi Marir; John Gordon; Helian Na

Journal Article

List representation applied to sparse datacubes for data warehousing and data mining

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 2690 871-875

DOI: 10.1007/978-3-540-45080-1_121

0Citations

2Readers

Get full text

Abstract

Typically 80% of the data in the logical OLAP datacube, the core engine of data warehouses, are zero. When it comes to sparse, the performance quickly degrades due to the heavy I/O overheads in sorting and merging intermediate results. In this work, we first introduce a list representation in main memory for storing and computing datasets. The sparse transaction dataset is compressed as the empty cells are removed Accordingly we propose a new algorithm for association rule mining on the platform of list representation, which just needs to scan the transaction database once to generate all the possible rules. In contrast, the well-known apriori algorithm requires repeated scans of the databases, thereby resulting in heavy I/O accesses particularly when considering large candidate datasets. In our opinion, this new algorithm using list representation economizes storage space and accesses. © Springer-Verlag 2003.

Cite

CITATION STYLE

APA

Wang, F., Marir, F., Gordon, J., & Na, H. (2004). List representation applied to sparse datacubes for data warehousing and data mining. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2690, 871–875. https://doi.org/10.1007/978-3-540-45080-1_121

List representation applied to sparse datacubes for data warehousing and data mining

Abstract

Cite

Register to see more suggestions