Optimisation of data-parallel scientific applications for modern HPC platforms is challenging in terms of efficient use of heterogeneous hardware and software. It requires partitioning the computations in proportion to the speeds of computing devices. Implementation of data partitioning algorithms based on computation performance models is not trivial. It requires accurate and efficient benchmarking of devices, which may share the same resources but execute different codes, appropriate interpolation methods to predict performance, and mathematical methods to solve the data partitioning problem. In this paper, we present a software framework that addresses these issues and automates the main steps of data partitioning. We demonstrate how it can be used to optimise data-parallel applications for modern heterogeneous HPC platforms. © 2013 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Clarke, D., Zhong, Z., Rychkov, V., & Lastovetsky, A. (2013). Fupermod: A framework for optimal data partitioning for parallel scientific applications on dedicated heterogeneous HPC platforms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7979 LNCS, pp. 182–196). Springer Verlag. https://doi.org/10.1007/978-3-642-39958-9_16
Mendeley helps you to discover research relevant for your work.