Integrated runtime measurement summarisation and selective event tracing for scalable parallel execution performance diagnosis

6Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Straightforward trace collection and processing becomes increasingly challenging and ultimately impractical for more complex, longrunning, highly parallel applications. Accordingly, the SCALASCA project is extending the KOJAK measurement system for MPI, OpenMP and partitioned global address space (PGAS) parallel applications to incorporate runtime management and summarisation capabilities. This offers a more scalable and effective profile of parallel execution performance for an initial overview and to direct instrumentation and event tracing to the key functions and callpaths for comprehensive analysis. The design and re-structuring of the revised measurement system are described, high-lighting the synergies possible from integrated runtime callpath summarisation and event tracing for scalable parallel execution performance diagnosis. Early results from measurements of 16,384 MPI processes on IBM BlueGene/L already demonstrate considerably improved scalability. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Wylie, B. J. N., Wolf, F., Mohr, B., & Geimer, M. (2007). Integrated runtime measurement summarisation and selective event tracing for scalable parallel execution performance diagnosis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4699 LNCS, pp. 460–469). Springer Verlag. https://doi.org/10.1007/978-3-540-75755-9_55

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free