PsmArena: Partitioned shared memory for NUMA-awareness in multithreaded scientific applications

8Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

The Distributed Shared Memory (DSM) architecture is widely used in today's computer design to mitigate the ever-widening processing-memory gap, and it inevitably exhibits Non-Uniform Memory Access (NUMA) to shared-memory parallel applications. Failure to adapt to the NUMA effect can significantly downgrade application performance, especially on today's manycore platforms with tens to hundreds of cores. However, traditional approaches such as first-touch and memory policy fall short in false page-sharing, fragmentation, or ease of use. In this paper, we propose a partitioned shared-memory approach that allows multithreaded applications to achieve full NUMA-awareness with only minor code changes and develop an accompanying NUMA-aware heap manager which eliminates false page-sharing and minimizes fragmentation. Experiments on a 256-core cc-NUMA computing node show that the proposed approach helps applications to adapt to NUMA with only minor code changes and improves the performance of typical multithreaded scientific applications by up to 4.3 folds with the increased use of cores.

Cite

CITATION STYLE

APA

Yang, Z., Zhang, A., & Mo, Z. (2021). PsmArena: Partitioned shared memory for NUMA-awareness in multithreaded scientific applications. Tsinghua Science and Technology, 26(3), 287–295. https://doi.org/10.26599/TST.2019.9010036

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free