Offload compiler runtime for the Intel® Xeon Phi™ coprocessor

15Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Intel® Xeon Phi™ coprocessor platform enables offload of computation from a host processor to a coprocessor that is a fully-functional Intel® Architecture CPU. This paper presents the C/C++ and Fortran compiler offload runtime for that coprocessor. The paper addresses why offload to a coprocessor is useful, how it is specified, and what the conditions for the profitability of offload are. It also serves as a guide to potential third-party developers of offload runtimes, such as a gcc-based offload compiler, ports of existing commercial offloading compilers to Intel® Xeon Phi™ coprocessor such as CAPS®, and third-party offload library vendors that Intel is working with, such as NAG® and MAGMA®. It describes the software architecture and design of the offload compiler runtime. It enumerates the key performance features for this heterogeneous computing stack, related to initialization, data movement and invocation. Finally, it evaluates the performance impact of those features for a set of directed micro-benchmarks and larger workloads. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Newburn, C. J., Deodhar, R., Dmitriev, S., Murty, R., Narayanaswamy, R., Wiegert, J., … McGuire, R. (2013). Offload compiler runtime for the Intel® Xeon PhiTM coprocessor. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7905 LNCS, pp. 239–254). https://doi.org/10.1007/978-3-642-38750-0_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free