Big data management processing with Hadoop MapReduce and spark technology: A comparison

57Citations
Citations of this article
83Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Hadoop MapReduce is processed for analysis large volume of data through multiple nodes in parallel. However MapReduce has two function Map and Reduce, large data is stored through HDFS. Lack of facility involve in MapReduce so Spark is designed to run for real time stream data and for fast queries. Spark jobs perform work on Resilient Distributed Datasets and directed acyclic graph execution engine. In this paper, we extend Hadoop MapReduce working and Spark architecture with supporting kind of operation to perform. We also show the differences between Hadoop MapReduce and Spark through Map and Reduce phase individually.

Cite

CITATION STYLE

APA

Verma, A., Mansuri, A. H., & Jain, N. (2016). Big data management processing with Hadoop MapReduce and spark technology: A comparison. In 2016 Symposium on Colossal Data Analysis and Networking, CDAN 2016. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/CDAN.2016.7570891

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free