Deserializing JSON data in hadoop

Shih Ying Chen; Hung Ming Chen; I. Hsueh Chen; Chien Che Huang

Conference Proceedings

Deserializing JSON data in hadoop

Lecture Notes in Electrical Engineering (2015) 329 79-85

DOI: 10.1007/978-94-017-9558-6_10

0Citations

1Readers

Get full text

Abstract

MapReduce provides an efficient programming framework for processing big data in parallel in Hadoop. On the other hand, as the digitalized data becomes bigger as the advances information of technology, deserializing big JSON data into paths in advance can benefit queries on the data. Therefore, using MapReduce framework to deserialize big JSON data into JSON paths is applicable. In this paper, we propose an efficient JSON data processing mechanism based on MapReduce framework. The mechanism includes a redesign of JSONInputFormat class and the other two Map and Reduce functions.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, S. Y., Chen, H. M., Chen, I. H., & Huang, C. C. (2015). Deserializing JSON data in hadoop. In Lecture Notes in Electrical Engineering (Vol. 329, pp. 79–85). Springer Verlag. https://doi.org/10.1007/978-94-017-9558-6_10

Deserializing JSON data in hadoop

Abstract

Author supplied keywords

Cite

Register to see more suggestions