One of the automated methods for textual data analysis is topic detection. Fuzzy C-Means is a soft clustering-based method for topic detection. Textual data usually has a high dimensional data, which make Fuzzy C-Means fails for topic detection. An approach to overcome the problem is transforming the textual data into lower dimensional space to identify the memberships of the textual data in clusters and use these memberships to generate topics from the high dimensional textual data in the original space. In this paper, we apply the Fuzzy C-Means in lower dimensional space for topic detection on Indonesian online news. Our simulations show that the Fuzzy C-Means gives comparable accuracies than nonnegative matrix factorization and better accuracies than latent Dirichlet allocation regarding topic interpretation in the form of coherence values.
CITATION STYLE
Nugraha, P., Rifky Yusdiansyah, M., & Murfi, H. (2019). Fuzzy C-means in lower dimensional space for topics detection on indonesian online news. In Communications in Computer and Information Science (Vol. 1071, pp. 269–276). Springer Verlag. https://doi.org/10.1007/978-981-32-9563-6_28
Mendeley helps you to discover research relevant for your work.