Big Data Analytics with Apache Hadoop MapReduce Framework

  • Greeshma L
  • Pradeepini G
N/ACitations
Citations of this article
21Readers
Mendeley users who have this article in their library.

Abstract

Huge amount of data cannot be handled by conventional database management system. For storing, processing and accessing massive volume of data, which is possible with help of Big data. In this paper we discussed the Hadoop Distributed File System and MapReduce architecture for storing and retrieving information from massive volume of datasets. In this paper we proposed a WordCount application of MapReduce object oriented programming paradigm. It divides input file into splits or tokens that is done with help of java.util.StingTokenizer class. Output file is represented in the form of <<key>, value>. The experimental results are conducted on Hadoop framework by loading large number of input files and evaluating the performance of Hadoop framework with respect to MapReduce object oriented programming paradigm. In this paper we have examined the performance of the map task and the reduce task by loading more number of files and read-write operations that are achieved by these jobs.

Cite

CITATION STYLE

APA

Greeshma, L. ., & Pradeepini, G. (2016). Big Data Analytics with Apache Hadoop MapReduce Framework. Indian Journal of Science and Technology, 9(26). https://doi.org/10.17485/ijst/2016/v9i26/93418

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free