Data mining (DM) applications are composed of computing-intensive processing tasks working on huge datasets. Due to its computing-intensive nature, these applications are natural candidates for execution on high performance, high throughput platforms such as PC clusters and computational grids. Many data mining algorithms can be implemented as bag-of-tasks (BoT) applications, i.e., parallel applications composed of independent tasks. This paper discusses the use of computing grids for the execution of DM algorithms as BoT applications, investigates the scalability of the execution of an application and proposes an approach to improve its scalability. © Springer-Verlag 2004.
CITATION STYLE
Da Silva, F. A. B., Carvalho, S., Senger, H., Hruschka, E. R., & De Farias, C. R. G. (2004). Running data mining applications on the grid: A bag-of-tasks approach. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3044, 168–177. https://doi.org/10.1007/978-3-540-24709-8_18
Mendeley helps you to discover research relevant for your work.