Abstract
This paper presents a method for detecting words related to a topic (we call them topic words) over time in the stream of documents. Topic words are widely distributed in the stream of documents, and sometimes they frequently appear in the documents, and sometimes not. We propose a method to reinforce topic words with low frequencies by collecting documents from the corpus, and applied Latent Dirichlet Allocation (Blei et al., 2003) to these documents. For the results of LDA, we identified topic words by using Moving Average Convergence Divergence. In order to evaluate the method, we applied the results of topic detection to extractive multi-document summarization. The results showed that the method was effective for sentence selection in summarization. © 2014 Association for Computational Linguistics.
Cite
CITATION STYLE
Suzuki, Y., & Fukumoto, F. (2014). Detection of topic and its extrinsic evaluation through multi-document summarization. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference (Vol. 2, pp. 241–246). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p14-2040
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.