LTSG: Latent topical skip-gram for mutually improving topic model and vector representations

Jarvan Law; Hankz Hankui Zhuo; Jun Hua He; Erhu Rong

Conference ProceedingsOPEN ACCESS

LTSG: Latent topical skip-gram for mutually improving topic model and vector representations

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11258 LNCS 375-387

DOI: 10.1007/978-3-030-03338-5_32

7Citations

33Readers

Abstract

Topic models have been widely used in discovering latent topics which are shared across documents in text mining. Vector representations, word embeddings and topic embeddings, map words and topics into a low-dimensional and dense real-value vector space, which have obtained high performance in NLP tasks. However, most of the existing models assume the results trained by one of them are perfect correct and used as prior knowledge for improving the other model. Some other models use the information trained from external large corpus to help improving smaller corpus. In this paper, we aim to build such an algorithm framework that makes topic models and vector representations mutually improve each other within the same corpus. An EM-style algorithm framework is employed to iteratively optimize both topic model and vector representations. Experimental results show that our model outperforms state-of-the-art methods on various NLP tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Law, J., Zhuo, H. H., He, J. H., & Rong, E. (2018). LTSG: Latent topical skip-gram for mutually improving topic model and vector representations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11258 LNCS, pp. 375–387). Springer Verlag. https://doi.org/10.1007/978-3-030-03338-5_32

LTSG: Latent topical skip-gram for mutually improving topic model and vector representations

Abstract

Author supplied keywords

Cite

Register to see more suggestions