Improved CUDA PSO based on global topology

Joanna Ko̷lodziejczyk; Dariusz Sychel; Aneta Bera

Conference Proceedings

Improved CUDA PSO based on global topology

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10245 LNAI 347-358

DOI: 10.1007/978-3-319-59063-9_31

0Citations

3Readers

Get full text

Abstract

We introduce a well-optimized implementation of PSO algorithm based on, Compute Unified Device Architecture (CUDA), using global neighborhood topology with extremely large swarms (greater than 1000 particles). The algorithm optimization is based on effective data organization in GPU memory such as transfer and thread optimization, pinned memory and the zero-copy mechanism usage. Experimental results show that the implementation on GPU is significantly faster than implementation on CPU.

Author supplied keywords

Cite

CITATION STYLE

APA

Ko̷lodziejczyk, J., Sychel, D., & Bera, A. (2017). Improved CUDA PSO based on global topology. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10245 LNAI, pp. 347–358). Springer Verlag. https://doi.org/10.1007/978-3-319-59063-9_31

Improved CUDA PSO based on global topology

Abstract

Author supplied keywords

Cite

Register to see more suggestions