Word embeddings for multi-label document classification

14Citations
Citations of this article
96Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we analyze and evaluate word embeddings for representation of longer texts in the multi-label document classification scenario. The embeddings are used in three convolutional neural network topologies. The experiments are realized on the Czech ČTK and English Reuters-21578 standard corpora. We compare the results of word2vec static and trainable embeddings with randomly initialized word vectors. We conclude that initialization does not play an important role for classification. However, learning of word vectors is crucial to obtain good results.

Cite

CITATION STYLE

APA

Lenc, L., & Král, P. (2017). Word embeddings for multi-label document classification. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 431–437). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_057

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free