Enhancing the performance of the DCA when forming and solving the equations of motion for multibody systems

5Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This chapter provides an initial investigation into using the Graphics Processing Unit (GPU) (or similar hardware) to execute the Divide-and-Conquer Algorithm (DCA), which forms and solves the equations-of-motion for articulated multibody systems. The computational time required to form and solve the equationsof- motion of a simple n-length pendulum using the GPU is compared with a standard serialCPUimplementation, a rudimentary parallelization on theCPUusing OpenMP, and some combinations of theCPUand theGPU.The hybrid version uses theGPUfor a select number of levels in the recursive sweeps and uses an OpenMP parallelization on a multi-core CPU for the remaining levels of recursion. The results demonstrate a significant performance increase when the GPU is used despite recursive algorithms being ill-suited to hardware designed for Single Instruction Multi-Data (SIMD). This is largely due to the tree-type structure of recursive processes, with half of the required operations being contained in the first level of recursion for a binary tree.

Cite

CITATION STYLE

APA

Laflin, J. J., Anderson, K. S., & Hans, M. (2016). Enhancing the performance of the DCA when forming and solving the equations of motion for multibody systems. In Computational Methods in Applied Sciences (Vol. 42, pp. 19–31). Springer Netherland. https://doi.org/10.1007/978-3-319-30614-8_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free