Automatic trace analysis is an effective method for identifying complex performance phenomena in parallel applications. However, as the size of parallel systems and the number of processors used by individual applications is continuously raised, the traditional approach of analyzing a single global trace file, as done by KOJAK'S EXPERT trace analyzer, becomes increasingly constrained by the large number of events. In this article, we present a scalable version of the EXPERT analysis based on analyzing separate local trace files with a parallel tool which 'replays' the target application's communication behavior. We describe the new parallel analyzer architecture and discuss first empirical results. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Geimer, M., Wolf, F., Wylie, B. J. N., & Mohr, B. (2006). Scalable parallel trace-based performance analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4192 LNCS, pp. 303–312). Springer Verlag. https://doi.org/10.1007/11846802_43
Mendeley helps you to discover research relevant for your work.