An efficient coarse-grained parallel algorithm for global-local multiscale computations on massively parallel systems

13Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The existing global-local multiscale computational methods, using finite element discretization at both the macro-scale and micro-scale, are intensive both in terms of computational time and memory requirements and their parallelization using domain decomposition methods incur substantial communication overhead, limiting their application. We are interested in a class of explicit global-local multiscale methods whose architecture significantly reduces this communication overhead on massively parallel machines. However, a naïve task decomposition based on distributing individual macro-scale integration points to a single group of processors is not optimal and leads to communication overheads and idling of processors. To overcome this problem, we have developed a novel coarse-grained parallel algorithm in which groups of macro-scale integration points are distributed to a layer of processors. Each processor in this layer communicates locally with a group of processors that are responsible for the micro-scale computations. The overlapping groups of processors are shown to achieve optimal concurrency at significantly reduced communication overhead. Several example problems are presented to demonstrate the efficiency of the proposed algorithm. © 2009 John Wiley & Sons, Ltd.

Cite

CITATION STYLE

APA

Rahul, & de, S. (2010). An efficient coarse-grained parallel algorithm for global-local multiscale computations on massively parallel systems. International Journal for Numerical Methods in Engineering, 82(3), 379–402. https://doi.org/10.1002/nme.2776

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free