On performance analysis of a multithreaded application parallelized by different programming models using intel VTune

11Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multi-core processors are ubiquitous. Extracting the desired performance from them requires efficient techniques for partitioning a single piece of work into multiple fine-grained units of work in order to process them simultaneously. Understanding the performance behavior of a parallel system requires a close familiarity with the underlying architecture and the hardware counters. We present a performance analysis study of a multi-core system by a state-of-the-art parallel performance analyzer tool, the Intel VTune Performance Analyzer. We chose as a test-case a classic nested-loop application that exhibits unexpected performance gains using two different programming models on the same multi-core system. Our expectations were to be able to reason about the performance results by exploring the application behavior using the parallel analyzer tool. We found that it is very difficult to explain high-level performance measurements of multi-core systems by low-level hardware diagnosis. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Marowka, A. (2011). On performance analysis of a multithreaded application parallelized by different programming models using intel VTune. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6873 LNCS, pp. 317–331). https://doi.org/10.1007/978-3-642-23178-0_28

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free