Research and application of DBSCAN algorithm based on Hadoop platform

Xiufen Fu; Yaguang Wang; Yanna Ge; Peiwen Chen; Shaohua Teng

Conference Proceedings

Research and application of DBSCAN algorithm based on Hadoop platform

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8351 LNCS 73-87

DOI: 10.1007/978-3-319-09265-2_9

5Citations

6Readers

Get full text

Abstract

Along with the rapid development of information age, more and more data can be obtained from the Internet, it is very difficult to get useful information and knowledge from these huge amounts of data. On the foundation of the existing algorithm based on DBSCAN, a new improved incremental DBSCAN clustering algorithm is proposed. Combining with cloud computing open source framework Hadoop, the improved algorithm use the programming model of MapReduce which can easy write distributed applications and simplify distributed programme to divide a huge amounts of data elements into chunks and distribute the chunks across the cluster and run the algorithm as a MapReduce job, in this way, this improved algorithm of data mining is integrated with framework Hadoop by the DBSCAN clustering algorithm. When data manipulation (add or delete) has occurred in the database, what we need to do is to mine the mutative data and merge the similar clusters, and ultimately form the final knowledge mining.Compared with single node server serial arithmetic and the overall mining, the time delay of data processing will be reduced. In the last part,the paper verified the effectiveness by experiments and data analysis. © 2014 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Fu, X., Wang, Y., Ge, Y., Chen, P., & Teng, S. (2014). Research and application of DBSCAN algorithm based on Hadoop platform. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8351 LNCS, pp. 73–87). Springer Verlag. https://doi.org/10.1007/978-3-319-09265-2_9

Research and application of DBSCAN algorithm based on Hadoop platform

Abstract

Author supplied keywords

Cite

Register to see more suggestions