Performance improvement of MapReduce framework by identifying slow TaskTrackers in heterogeneous Hadoop cluster

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

MapReduce is presently recognized as a significant parallel and distributed programming model with wide acclaim for large scale computing. MapReduce framework divides a job into map, reduce tasks and schedules these tasks in a distributed manner across the cluster. Scheduling of tasks and identification of “slow TaskTrackers” in heterogeneous Hadoop clusters is the focus of recent research. MapReduce performance is currently limited by its default scheduler, which does not adapt well in heterogeneous environments. In this paper, we propose a scheduling method to identify “slow TaskTrackers” in a heterogeneous Hadoop cluster and implement the proposed method by integrating it with the Hadoop default scheduling algorithm. The performance of this method is compared with the Hadoop default scheduler. We observe that the proposed approach shows modest but consistent improvement against the default Hadoop scheduler in heterogeneous environments. We see that it improves by minimizing the overall job execution time.

Cite

CITATION STYLE

APA

Naik, N. S., Negi, A., & Sastry, V. N. (2016). Performance improvement of MapReduce framework by identifying slow TaskTrackers in heterogeneous Hadoop cluster. In Smart Innovation, Systems and Technologies (Vol. 44, pp. 465–473). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-81-322-2529-4_49

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free