Evaluating MapReduce on virtual machines: The Hadoop case

Shadi Ibrahim; Hai Jin; Lu Lu; Li Qi; Song Wu; Xuanhua Shi

Conference Proceedings

Evaluating MapReduce on virtual machines: The Hadoop case

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5931 LNCS 519-528

DOI: 10.1007/978-3-642-10665-1_47

92Citations

82Readers

Get full text

Abstract

MapReduce is emerging as an important programming model for large scale parallel application. Meanwhile, Hadoop is an open source implementation of MapReduce enjoying wide popularity for developing data intensive applications in the cloud. As, in the cloud, the computing unit is virtual machine (VM) based; it is feasible to demonstrate the applicability of MapReduce on virtualized data center. Although the potential for poor performance and heavy load no doubt exists, virtual machines can instead be used to fully utilize the system resources, ease the management of such systems, improve the reliability, and save the power. In this paper, a series of experiments are conducted to measure and analyze the performance of Hadoop on VMs. Our experiments are used as a basis for outlining several issues that will need to be considered when implementing MapReduce to fit completely in the cloud. © 2009 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Ibrahim, S., Jin, H., Lu, L., Qi, L., Wu, S., & Shi, X. (2009). Evaluating MapReduce on virtual machines: The Hadoop case. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5931 LNCS, pp. 519–528). https://doi.org/10.1007/978-3-642-10665-1_47

Evaluating MapReduce on virtual machines: The Hadoop case

Abstract

Author supplied keywords

Cite

Register to see more suggestions