Microblog has the characteristic of short length, complex structure and words deformation. In this paper, a two stage clustering algorithm based on probabilistic latent semantic analysis (pLSA) and K-means clustering (K-means) is proposed. Besides, this paper also presents the definition of popularity and mechanism of sorting the topics. Experiments show that our method can effectively cluster topics and be applied to microblog hot topic detection.
CITATION STYLE
Sun, Y., Ma, H., Jia, M., & Peiqing, W. (2014). An efficient microblog hot topic detection algorithm based on two stage clustering. In IFIP Advances in Information and Communication Technology (Vol. 432, pp. 90–95). Springer New York LLC. https://doi.org/10.1007/978-3-662-44980-6_10
Mendeley helps you to discover research relevant for your work.