Performance analysis tool for HPC and big data applications on scientific clusters

3Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Big data is prevalent in HPC computing. Many HPC projects rely on complex workflows to analyze terabytes or petabytes of data. These workflows often require running over thousands of CPU cores and performing simultaneous data accesses, data movements, and computation. It is challenging to analyze the performance involving terabytes or petabytes of workflow data or measurement data of the executions, from complex workflows over a large number of nodes and multiple parallel task executions. To help identify performance bottlenecks or debug the performance issues in large-scale scientific applications and scientific clusters, we have developed a performance analysis framework, using state-ofthe- art open-source big data processing tools. Our tool can ingest system logs and application performance measurements to extract key performance features, and apply the most sophisticated statistical tools and data mining methods on the performance data. It utilizes an efficient data processing engine to allow users to interactively analyze a large amount of different types of logs and measurements. To illustrate the functionality of the big data analysis framework, we conduct case studies on the workflows from an astronomy project known as the Palomar Transient Factory (PTF) and the job logs from the genome analysis scientific cluster.

Cite

CITATION STYLE

APA

Yoo, W., Koo, M., Cao, Y., Sim, A., Nugent, P., & Wu, K. (2016). Performance analysis tool for HPC and big data applications on scientific clusters. In Conquering Big Data with High Performance Computing (pp. 137–160). Springer International Publishing. https://doi.org/10.1007/978-3-319-33742-5_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free