Straightforward trace collection and processing becomes increasingly challenging and ultimately impractical for more complex, longrunning, highly parallel applications. Accordingly, the SCALASCA project is extending the KOJAK measurement system for MPI, OpenMP and partitioned global address space (PGAS) parallel applications to incorporate runtime management and summarisation capabilities. This offers a more scalable and effective profile of parallel execution performance for an initial overview and to direct instrumentation and event tracing to the key functions and callpaths for comprehensive analysis. The design and re-structuring of the revised measurement system are described, high-lighting the synergies possible from integrated runtime callpath summarisation and event tracing for scalable parallel execution performance diagnosis. Early results from measurements of 16,384 MPI processes on IBM BlueGene/L already demonstrate considerably improved scalability. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Wylie, B. J. N., Wolf, F., Mohr, B., & Geimer, M. (2007). Integrated runtime measurement summarisation and selective event tracing for scalable parallel execution performance diagnosis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4699 LNCS, pp. 460–469). Springer Verlag. https://doi.org/10.1007/978-3-540-75755-9_55
Mendeley helps you to discover research relevant for your work.