In this paper, we present Latent Drichlet Allocation in automatic text summarization to improve accuracy in document clustering. The experiments involving 398 data set from public blog article obtained by using python scrapy crawler and scraper. Several steps of clustering in this research are preprocessing, automatic document compression using feature method, automatic document compression using LDA, word weighting and clustering algorithm The results show that automatic document summarization with LDA reaches 72% in LDA 40%, compared to traditional k-means method which only reaches 66%.
CITATION STYLE
Hidayat, E. Y., Firdausillah, F., Hastuti, K., Dewi, I. N., & Azhari. (2015). Automatic text summarization using latent drichlet allocation (LDA) for document clustering. International Journal of Advances in Intelligent Informatics, 1(3), 132–139. https://doi.org/10.26555/ijain.v1i3.43
Mendeley helps you to discover research relevant for your work.