Folding: Detailed analysis with coarse sampling

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Performance analysis tools help the application users to find bottlenecks that prevent the application to run at full speed in current supercomputers. The level of detail and the accuracy of the performance tools are crucial to completely depict the nature of the bottlenecks. The details exposed do not only depend on the nature of the tools (profile-based or trace-based) but also on the mechanism on which they rely (instrumentation or sampling) to gather information. In this paper we present a mechanism called folding that combines both instrumentation and sampling for trace-based performance analysis tools. The folding mechanism takes advantage of long execution runs and low frequency sampling to finely detail the evolution of the user code with minimal overhead on the application. The reports provided by the folding mechanism are extremely useful to understand the behavior of a region of code at a very low level.We also present a practical study we have done in a in-production scenario with the folding mechanism and show that the results of the folding resembles to high frequency sampling. © Springer-Verlag Berlin Heidelberg 2012.

Cite

CITATION STYLE

APA

Servat, H., Llort, G., Giménez, J., Huck, K., & Labarta, J. (2012). Folding: Detailed analysis with coarse sampling. In Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing 2011 (pp. 105–118). https://doi.org/10.1007/978-3-642-31476-6_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free