Ecosystem Description of Hadoop Platform Based on HDFS, MapReduce and Data Warehouse Tool Hive

Hongsheng Xu; Xiangkui Chen; Ganglong Fan

Conference Proceedings

Ecosystem Description of Hadoop Platform Based on HDFS, MapReduce and Data Warehouse Tool Hive

Advances in Intelligent Systems and Computing (2020) 928 1127-1133

DOI: 10.1007/978-3-030-15235-2_149

1Citations

6Readers

Get full text

Abstract

This paper introduces the processing process of the distributed file system (HDFS, MapReduce) which is the core of the Hadoop distributed computing platform and introduces the data warehouse tool Hive and the distributed database Hbase. Spark is a big data distributed programming framework, which not only implements MapReduce operator map function and reduce function and calculation model, but also provides more abundant operators. This paper describes the ecosystem of Hadoop platform based on HDFS, MapReduce and data warehouse tool Hive.

Author supplied keywords

Cite

CITATION STYLE

APA

Xu, H., Chen, X., & Fan, G. (2020). Ecosystem Description of Hadoop Platform Based on HDFS, MapReduce and Data Warehouse Tool Hive. In Advances in Intelligent Systems and Computing (Vol. 928, pp. 1127–1133). Springer Verlag. https://doi.org/10.1007/978-3-030-15235-2_149

Ecosystem Description of Hadoop Platform Based on HDFS, MapReduce and Data Warehouse Tool Hive

Abstract

Author supplied keywords

Cite

Register to see more suggestions