We present a CUDA implementation for Kepler and Maxwell GPU generations of neuroimaging registration based on the NiftyReg open-source library [1]. A wide number of strategies are deployed to accelerate the code, providing insightful guidelines to exploit the massive parallelism and memory hierarchy within emerging GPUs. Our efforts are analyzed from different perspectives: Acceleration, numerical accuracy, power consumption and energy efficiency, to identify potential scenarios where performance per watt can be optimal in large-scale biomedical applications. Experimental results suggest that parallelism and arithmetic intensity represent the most rewarding ways on the road to high performance bioinformatics when power is a major concern.
CITATION STYLE
Álvarez, F. N., Cabrera, J. A., Chico, J. F., Pérez, J., & Ujaldón, M. (2016). Neuroimaging registration on GPU: Energy-aware acceleration. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9656, pp. 627–638). Springer Verlag. https://doi.org/10.1007/978-3-319-31744-1_55
Mendeley helps you to discover research relevant for your work.