Applying parallel design techniques to template matching with GPUs

Robert Finis Anderson; J. Steven Kirtzic; Ovidiu Daescu

Conference Proceedings

Applying parallel design techniques to template matching with GPUs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6449 LNCS 456-468

DOI: 10.1007/978-3-642-19328-6_41

7Citations

16Readers

Get full text

Abstract

Designing algorithms for data parallelism can create significant gains in performance on SIMD architectures. The performance of General Purpose GPUs can also benefit from careful analysis of memory usage and data flow due to their large throughput and system memory bottlenecks. In this paper we present an algorithm for template matching that is designed from the beginning for the GPU architecture and achieves greater than an order of magnitude speedup over traditional algorithms designed for the CPU and reimplemented on the GPU. This shows that it is not only desirable to adapt existing algorithms to run on GPUs, but also that future algorithms should be designed with the GPU architecture in mind. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Finis Anderson, R., Kirtzic, J. S., & Daescu, O. (2011). Applying parallel design techniques to template matching with GPUs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6449 LNCS, pp. 456–468). https://doi.org/10.1007/978-3-642-19328-6_41

Applying parallel design techniques to template matching with GPUs

Abstract

Cite

Register to see more suggestions