Fast, parallel implementation of particle filtering on the GPU architecture

Anna Gelencsér-Horváth; Gábor János Tornai; András Horváth; György Cserey

Journal ArticleOPEN ACCESS

Fast, parallel implementation of particle filtering on the GPU architecture

Eurasip Journal on Advances in Signal Processing (2013) 2013(1)

DOI: 10.1186/1687-6180-2013-148

5Citations

21Readers

Abstract

In this paper, we introduce a modified cellular particle filter (CPF) which we mapped on a graphics processing unit (GPU) architecture. We developed this filter adaptation using a state-of-the art CPF technique. Mapping this filter realization on a highly parallel architecture entailed a shift in the logical representation of the particles. In this process, the original two-dimensional organization is reordered as a one-dimensional ring topology. We proposed a proof-of-concept measurement on two models with an NVIDIA Fermi architecture GPU. This design achieved a 411- μs kernel time per state and a 77-ms global running time for all states for 16,384 particles with a 256 neighbourhood size on a sequence of 24 states for a bearing-only tracking model. For a commonly used benchmark model at the same configuration, we achieved a 266- μs kernel time per state and a 124-ms global running time for all 100 states. Kernel time includes random number generation on the GPU with curand. These results attest to the effective and fast use of the particle filter in high-dimensional, real-time applications. © 2013 Gelencsér-Horváth et al.; licensee Springer.

Cite

CITATION STYLE

APA

Gelencsér-Horváth, A., Tornai, G. J., Horváth, A., & Cserey, G. (2013). Fast, parallel implementation of particle filtering on the GPU architecture. Eurasip Journal on Advances in Signal Processing, 2013(1). https://doi.org/10.1186/1687-6180-2013-148

Fast, parallel implementation of particle filtering on the GPU architecture

Abstract

Cite

Register to see more suggestions