Towards distributed heterogenous high-performance computing with ViennaCL

3Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One of the major drawbacks of computing with graphics adapters is the limited available memory for relevant problem sizes. To overcome this limitation for the ViennaCL library, we investigate a partitioning approach for one of the standard benchmark problems in High-Performance Computing (HPC), namely the dense matrix-matrix product. We apply this partitioning approach to problems exceeding the available memory on graphics adapters. Moreover, we investigate the applicability on distributed memory systems by facilitating the Message Passing Interface (MPI). Our approach is presented in detail and benchmark results are given. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Weinbub, J., Rupp, K., & Selberherr, S. (2012). Towards distributed heterogenous high-performance computing with ViennaCL. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7116 LNCS, pp. 359–367). https://doi.org/10.1007/978-3-642-29843-1_41

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free