Word Embeddings Versus LDA for Topic Assignment in Documents

Joanna Jȩdrzejowicz; Magdalena Zakrzewska

Conference Proceedings

Word Embeddings Versus LDA for Topic Assignment in Documents

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10449 LNAI 357-366

DOI: 10.1007/978-3-319-67077-5_34

4Citations

9Readers

Get full text

Abstract

Topic assignment for a corpus of documents is a task of natural language processing (NLP). One of the noted and well studied methods is Latent Dirichlet Allocation (LDA) where statistical methods are applied. On the other hand applying deep-learning paradigm proved useful for many NLP tasks such as classification [3], sentiment analysis [8], text summarization [11]. This paper compares the results of LDA method and application of representations provided by Word2Vec [5] which makes use of deep learning paradigm.

Author supplied keywords

Cite

CITATION STYLE

APA

Jȩdrzejowicz, J., & Zakrzewska, M. (2017). Word Embeddings Versus LDA for Topic Assignment in Documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10449 LNAI, pp. 357–366). Springer Verlag. https://doi.org/10.1007/978-3-319-67077-5_34

Word Embeddings Versus LDA for Topic Assignment in Documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions