Design of a task-parallel version of ILUPACK for graphics processors

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In many scientific and engineering applications, the solution of large sparse systems of equations is one of the most important stages. For this reason, many libraries have been developed among which ILUPACK stands out due to its efficient inverse-based multilevel preconditioner. Several parallel versions of ILUPACK have been proposed in the past. In particular, two task-parallel versions, for shared and distributed memory platforms, and a GPU accelerated data-parallel variant have been developed to solve symmetric positive definite linear systems. In this work we evaluate the combination of both previously covered approaches. Specifically, we leverage the computational power of one GPU (associated with the data-level parallelism) to accelerate each computation of the multicore (task-parallel) variant of ILUPACK. The performed experimental evaluation shows that our proposal can accelerate the multicore variant when the leaf tasks of the parallel solver offer an acceptable dimension.

Cite

CITATION STYLE

APA

Aliaga, J. I., Dufrechou, E., Ezzatti, P., & Quintana-Ortí, E. S. (2017). Design of a task-parallel version of ILUPACK for graphics processors. In Communications in Computer and Information Science (Vol. 697, pp. 91–103). Springer Verlag. https://doi.org/10.1007/978-3-319-57972-6_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free