A hybrid clustering technique to improve big data accessibility based on machine learning approaches

7Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Big data is called to a large or complex data from traditional ones, which is unstructured in many case. Accessing to a specific value in a huge data that is not sorted or organized can be time consuming and require a high processing. With growing of data, clustering can be a most important unsupervised approach that finds a structure for data. In this paper, we demonstrate two approaches to cluster data with high accuracy, and then we sort data by implementing merge sort algorithm finally, we use binary search to find a data value point in a specific range of data. This research presents a high value efficiency combo method in big data by using genetic and k-means. After clustering with k-means total sum of the Euclidean distances is 3.37233e+09 for 4 clusters, and after genetic algorithm this number reduce to 0.0300344 in the best fit. In the second and third stage we show that after this implementation, we can access to a particular data much faster and accurate than other older methods.

Cite

CITATION STYLE

APA

Ebadati, E. O. M., & Tabrizi, M. M. (2016). A hybrid clustering technique to improve big data accessibility based on machine learning approaches. In Advances in Intelligent Systems and Computing (Vol. 433, pp. 413–423). Springer Verlag. https://doi.org/10.1007/978-81-322-2755-7_43

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free