Applying Intelligence to the machines is a need in today’s world and this need leads to the evolution of machine learning. The analysis of data using machine learning algorithms is a trending research area and this analysis lead to some problems when the data comes out to be big data. This paper compares various classification based machine learning algorithms namely, Decision Tree Learning, Naïve Bayes, Random Forest and Support Vector Machines on big data using Apache Spark. The accuracy is evaluated to find out which classification based algorithm gives fast and better result.
CITATION STYLE
Mogha, G., Ahlawat, K., & Singh, A. P. (2018). Performance analysis of machine learning techniques on big data using apache spark. In Communications in Computer and Information Science (Vol. 799, pp. 17–26). Springer Verlag. https://doi.org/10.1007/978-981-10-8527-7_2
Mendeley helps you to discover research relevant for your work.