The process of designing a parallel data warehouse has two main steps: (1) fragmentation and (2) allocation of so-generated fragments at various nodes. Usually, we split the data warehouse horizontally, allocate fragments over nodes, and finally balance the load over the nodes of the parallel machine. The main drawback of such design approach is that the high communication cost. Therefore, Data Replication (DR) has become a requirement for availability on the one hand but also for minimizing the communication cost on the other hand. In this paper, we present a redundant allocation algorithm for designing shared-nothing parallel relational data warehouses, which is based on the well-known fuzzy k-means clustering algorithm.
CITATION STYLE
Benkrid, S., Bellatreche, L., & Cuzzocrea, A. (2014). Designing parallel relational data warehouses: A global, comprehensive approach. In Advances in Intelligent Systems and Computing (Vol. 241, pp. 141–150). Springer Verlag. https://doi.org/10.1007/978-3-319-01863-8_16
Mendeley helps you to discover research relevant for your work.