Performance comparison of distributed processing of large volume of data on top of xen and Docker-Based virtual clusters

5Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recently, with the advent of cloud computing, it becomes essential to run distributed computing tasks, such as Hadoop MapReduce tasks, on top of virtual computing nodes instead of physical computing nodes. But, distributed big data processing on top of virtual machines usually causes unbalanced use of physical resources, such as memory, disk I/O and network resources, thus resulting in severe performance problems. In this paper, we show how virtualization methods affect distributed processing of very large volume of data, by comparing Hadoop MapReduce processing performance on top of Xen-based virtual clusters versus Docker-based virtual clusters. In our experiments, we compare the performance of two different virtual clusters by changing virtualization methods, block sizes and node numbers. Our results show that, in terms of the distributed big data processing performance, Docker-based virtual cluster is usually faster than Xen-based virtual cluster, but there exist some cases where Xen is faster than Docker according to the parameters, such as block size and virtual node numbers.

Cite

CITATION STYLE

APA

Chung, H., & Nah, Y. (2017). Performance comparison of distributed processing of large volume of data on top of xen and Docker-Based virtual clusters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10177 LNCS, pp. 103–113). Springer Verlag. https://doi.org/10.1007/978-3-319-55753-3_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free