MapReduce is a framework for processing huge volumes of data in parallel, on large groups of nodes. Processing enormous data requires fast coordination and allocation of resources. Emphasis is on achieving maximum performance with optimal resources. This paper portraits a technique for accomplishing better resource utilization. The main objective of the work is to incorporate virtualization in Hadoop MapReduce framework and measuring the performance enhancement. In order to realize this master node is setup on physical machine and slave nodes are setup in a common physical machine as virtual machines (VM), by cloning of Hadoop configured VM images. To further enhance the performance Hadoop virtual cluster are configured to use capacity scheduler.
CITATION STYLE
Kaur, R., Kaur, H., & Dhamija, A. (2016). Improved resource exploitation by combining hadoop map reduce framework with virtualbox. In Advances in Intelligent Systems and Computing (Vol. 433, pp. 41–49). Springer Verlag. https://doi.org/10.1007/978-81-322-2755-7_5
Mendeley helps you to discover research relevant for your work.