Applying parallel design techniques to template matching with GPUs

7Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Designing algorithms for data parallelism can create significant gains in performance on SIMD architectures. The performance of General Purpose GPUs can also benefit from careful analysis of memory usage and data flow due to their large throughput and system memory bottlenecks. In this paper we present an algorithm for template matching that is designed from the beginning for the GPU architecture and achieves greater than an order of magnitude speedup over traditional algorithms designed for the CPU and reimplemented on the GPU. This shows that it is not only desirable to adapt existing algorithms to run on GPUs, but also that future algorithms should be designed with the GPU architecture in mind. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Finis Anderson, R., Kirtzic, J. S., & Daescu, O. (2011). Applying parallel design techniques to template matching with GPUs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6449 LNCS, pp. 456–468). https://doi.org/10.1007/978-3-642-19328-6_41

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free