Advances in Knowledge Discovery and Data Mining

  • Ziegel E
  • Fayyad U
  • Piatetski-Shapiro G
  • et al.
N/ACitations
Citations of this article
1.6kReaders
Mendeley users who have this article in their library.
Get full text

Abstract

Topic modeling is a powerful tool to uncover hidden thematic structures of documents. Many conventional topic models represent documents as a bag-of-words, where the important linguistic structures of documents are neglected. In this paper, we propose a novel topic model that enriches text documents with collapsed typed dependency relations to effectively acquire syntactic and semantic dependencies between consecutive and nonconsecutive words of text documents. In addition, we propose to enforce coherent topic assignments for conceptually similar words by generalizing words with their synonyms. Our experimental studies show that the proposed model and strategy outperform the original LDA model and the Bigram Topic Model in terms of perplexity; and our performance is comparable to other models in terms of stability, coherence, and accuracy. © 2014 Springer International Publishing.

Cite

CITATION STYLE

APA

Ziegel, E. R., Fayyad, U. M., Piatetski-Shapiro, G., Smyth, P., & Uthurusamy, R. (1998). Advances in Knowledge Discovery and Data Mining. Technometrics, 40(1), 83. https://doi.org/10.2307/1271414

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free