This paper explores the parallel programmability, performance, and energy efficiency of a recently available Calxeda ARM-based server as a potential energy-efficient platform for computationally intensive applications. A novel OpenCL-based parallel programming model for the Calxeda ARM server is achieved via the use of a higher level STandarD Compute Layer (STDCL) application programming interface and a remote procedure call (RPC) implementation. Empirical measurements of the performance of the platform are obtained and presented using an N-body code executed in various configurations. Furthermore, an auto-tuning technique was developed and analyzed for optimization of the N-body algorithm on a specific architecture. © 2014 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Richie, D., Ross, J., Ruloff, J., Park, S., Pollock, L., & Shires, D. (2014). Investigation of parallel programmability and performance of a calxeda ARM server using OpenCL. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8374 LNCS, pp. 865–874). Springer Verlag. https://doi.org/10.1007/978-3-642-54420-0_84
Mendeley helps you to discover research relevant for your work.