Abstract
Multicore processors and a variety of accelerators have allowed scientific applications to scale to larger problem sizes. We present a performance, design methodology, platform, and architectural comparison of several application accelerators executing a Quantum Monte Carlo application. We compare the application's performance and programmability on a variety of platforms including CUDA with Nvidia GPUs, Brook+ with ATI graphics accelerators, OpenCL running on both multicore and graphics processors, C++ running on multicore processors, and a VHDL implementation running on a Xilinx FPGA. We show that OpenCL provides application portability between multicore processors and GPUs, but may incur a performance cost. Furthermore, we illustrate that graphics accelerators can make simulations involving large numbers of particles feasible. © 2011 IEEE.
Author supplied keywords
Cite
CITATION STYLE
Weber, R., Gothandaraman, A., Hinde, R. J., & Peterson, G. D. (2011). Comparing hardware accelerators in scientific applications: A case study. IEEE Transactions on Parallel and Distributed Systems, 23(1), 58–68. https://doi.org/10.1109/TPDS.2010.125
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.