Effective utilization of storage space by applying file level and block-level deduplication over HDFS

ISSN: 22783075
N/ACitations
Citations of this article
4Readers
Mendeley users who have this article in their library.

Abstract

Hadoop framework is very efficient and easy to handle huge records storage as well as its processing. Hadoop makes use of massive commodity hardware clusters to save and process massive data in an allotted fashion. Open Source, Massive information handling capabilities and faster processing abilities made it very popular. Existing Hadoop Framework destroys metadata of preceding jobs, it actually allocates Data Nodes via ignoring what it has processed earlier and hence for each new process it reads data from all Data Nodes. There isn't any provision made for checking relationships between similar data blocks. Thus it weakens the Hadoop overall performance. The uploaded big data files are partitioned in to number of blocks and are distributed over node clusters. To avoid random block distribution and data-duplication, deduplication system is used. Such deduplication system focuses on space management and only keeps track of data files on Hadoop Distributed File System (HDFS). Such system do not participate in efficient job execution in map reduce environment. For efficient execution of job, data locality information and job metadata is stored. Time required for job execution can be decreased for next execution of same job by preserving job metadata. A combined environment produce efficient job execution results with efficient space management.

Author supplied keywords

Cite

CITATION STYLE

APA

Thanekar, S. A., Subrahmanyam, K., & Bagwan, A. (2019). Effective utilization of storage space by applying file level and block-level deduplication over HDFS. International Journal of Innovative Technology and Exploring Engineering, 8(6), 725–730.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free