Parallel and Grid-Based Data Mining

Antonio Congiusta; Domenico Talia; Paolo Trunfio

Book Chapter

Parallel and Grid-Based Data Mining

Congiusta A
Talia D
Trunfio P

Springer-Verlag, (2006), 1017-1041

DOI: 10.1007/0-387-25465-x_48

N/ACitations

3Readers

Get full text

Abstract

Data Mining often is a computing intesive and time requiring process. For this reason, several Data Mining systems have been implemented on parallel computing platforms to achieve high performance in the analysis of large data sets. Moreover, when large data repositories are coupled with geographical distribution of data, users and systems, more sophisticated technologies are needed to implement high-performace distributed KDD systems. Since computational Grids emerged as privileged platforms for distributed computing, a growing number of Grid-based KDD systems has been proposed. In this chapter we first discuss different ways to exploit parallelism in the main Data Mining techniques and algorithms, then we discuss Grid-based KDD systems. Finally, we introduce the Knowledge Grid, an environment which makes use of standard Grid middleware to support the development of parallel and distributed knowledge discovery applications.

Cite

CITATION STYLE

APA

Congiusta, A., Talia, D., & Trunfio, P. (2006). Parallel and Grid-Based Data Mining. In Data Mining and Knowledge Discovery Handbook (pp. 1017–1041). Springer-Verlag. https://doi.org/10.1007/0-387-25465-x_48

Parallel and Grid-Based Data Mining

Abstract

Cite

Register to see more suggestions