Performance analysis of machine learning techniques on big data using apache spark

Garima Mogha; Khyati Ahlawat; Amit Prakash Singh

Conference Proceedings

Performance analysis of machine learning techniques on big data using apache spark

Communications in Computer and Information Science (2018) 799 17-26

DOI: 10.1007/978-981-10-8527-7_2

3Citations

6Readers

Get full text

Abstract

Applying Intelligence to the machines is a need in today’s world and this need leads to the evolution of machine learning. The analysis of data using machine learning algorithms is a trending research area and this analysis lead to some problems when the data comes out to be big data. This paper compares various classification based machine learning algorithms namely, Decision Tree Learning, Naïve Bayes, Random Forest and Support Vector Machines on big data using Apache Spark. The accuracy is evaluated to find out which classification based algorithm gives fast and better result.

Author supplied keywords

Cite

CITATION STYLE

APA

Mogha, G., Ahlawat, K., & Singh, A. P. (2018). Performance analysis of machine learning techniques on big data using apache spark. In Communications in Computer and Information Science (Vol. 799, pp. 17–26). Springer Verlag. https://doi.org/10.1007/978-981-10-8527-7_2

Performance analysis of machine learning techniques on big data using apache spark

Abstract

Author supplied keywords

Cite

Register to see more suggestions