Toward a MapReduce-Based K-Means Method for Multi-dimensional Time Serial Data Clustering

Yongzheng Lin; Kun Ma; Runyuan Sun; Ajith Abraham

Conference Proceedings

Toward a MapReduce-Based K-Means Method for Multi-dimensional Time Serial Data Clustering

Advances in Intelligent Systems and Computing (2018) 736 816-825

DOI: 10.1007/978-3-319-76348-4_78

1Citations

2Readers

Get full text

Abstract

Time series data is a sequence of real numbers that represent the measurements of a real variable at equal time intervals. There are some bottlenecks to process large scale data. In this paper, we firstly propose a K-means method for multi-dimensional time serial data clustering. As an improvement, MapReduce framework is used to implement this method in parallel. Different versions of k-means for several distance measures are compared, and the experiments show that MapReduce-based K-means has better speedup when the scale of data is larger.

Author supplied keywords

Cite

CITATION STYLE

APA

Lin, Y., Ma, K., Sun, R., & Abraham, A. (2018). Toward a MapReduce-Based K-Means Method for Multi-dimensional Time Serial Data Clustering. In Advances in Intelligent Systems and Computing (Vol. 736, pp. 816–825). Springer Verlag. https://doi.org/10.1007/978-3-319-76348-4_78

Toward a MapReduce-Based K-Means Method for Multi-dimensional Time Serial Data Clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions