Word Embeddings Versus LDA for Topic Assignment in Documents

4Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Topic assignment for a corpus of documents is a task of natural language processing (NLP). One of the noted and well studied methods is Latent Dirichlet Allocation (LDA) where statistical methods are applied. On the other hand applying deep-learning paradigm proved useful for many NLP tasks such as classification [3], sentiment analysis [8], text summarization [11]. This paper compares the results of LDA method and application of representations provided by Word2Vec [5] which makes use of deep learning paradigm.

Cite

CITATION STYLE

APA

Jȩdrzejowicz, J., & Zakrzewska, M. (2017). Word Embeddings Versus LDA for Topic Assignment in Documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10449 LNAI, pp. 357–366). Springer Verlag. https://doi.org/10.1007/978-3-319-67077-5_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free