Abstract
The requirements of research, analysis, processing and storing of big data are more and more important because big data is increasingly vital for development in the fields of information technology, finance, medicine, etc. Most of the big data environments are built on Hadoop or Spark. However, the constructions of these kinds of big data platform are not easy for ordinary users because of the lacks of professional knowledge and familiarity with the system. To make it easier to use the big data platform for data processing and analysis, we implemented the web user interface combining the big data platform including Hadoop and Spark. Then, we packaged the whole big data platform into the virtual machine image file along with the web user interface so that users can construct the environment and do the job more quickly and efficiently. We provide the convenient web user interface, not only reduce the difficulty of building a big data platform and save time but also provide an excellent performance of the system. And we also made the comparison of performance between the web user interface and the command line using the HiBench benchmark suit.
Author supplied keywords
Cite
CITATION STYLE
Yang, C. T., Wu, C. H., Chang, W. Y., Tsai, W. F., Chan, Y. W., Kristiani, E., & Chiang, Y. P. (2019). The implementation of a hadoop ecosystem portal with virtualization deployment. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 24, pp. 116–127). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-02607-3_11
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.