Scheduling MapReduce jobs in HPC clusters

Marcelo Veiga Neves; Tiago Ferreto; César De Rose

Conference ProceedingsOPEN ACCESS

Scheduling MapReduce jobs in HPC clusters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7484 LNCS 179-190

DOI: 10.1007/978-3-642-32820-6_19

9Citations

15Readers

Abstract

MapReduce (MR) has become a de facto standard for large-scale data analysis. Moreover, it has also attracted the attention of the HPC community due to its simplicity, efficiency and highly scalable parallel model. However, MR implementations present some issues that may complicate its execution in existing HPC clusters, specially concerning the job submission. While on MR there are no strict parameters required to submit a job, in a typical HPC cluster, users must specify the number of nodes and amount of time required to complete the job execution. This paper presents the MR Job Adaptor, a component to optimize the scheduling of MR jobs along with HPC jobs in an HPC cluster. Experiments performed using real-world HPC and MapReduce workloads have show that MR Job Adaptor can properly transform MR jobs to be scheduled in an HPC Cluster, minimizing the job turnaround time, and exploiting unused resources in the cluster. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Neves, M. V., Ferreto, T., & De Rose, C. (2012). Scheduling MapReduce jobs in HPC clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7484 LNCS, pp. 179–190). https://doi.org/10.1007/978-3-642-32820-6_19

Scheduling MapReduce jobs in HPC clusters

Abstract

Cite

Register to see more suggestions