Initializing deep learning based on latent dirichlet allocation for document classification

Hyung Bae Jeon; Soo Young Lee

Conference Proceedings

Initializing deep learning based on latent dirichlet allocation for document classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9949 LNCS 634-641

DOI: 10.1007/978-3-319-46675-0_70

1Citations

2Readers

Get full text

Abstract

The gradient-descent learning of deep neural networks is subject to local minima, and good initialization may depend on the tasks. In contrast, for document classification tasks, latent Dirichlet allocation (LDA) was quite successful in extracting topic representations, but its performance was limited by its shallow architecture. In this study, LDA was adopted for efficient layer-by-layer pre-training of deep neural networks for a document classification task. Two-layer feedforward networks were added at the end of the process, and trained using a supervised learning algorithm. With 10 different random initializations, the LDA-based initialization generated a much lower mean and standard deviation for false recognition rates than other state-of-the-art initialization methods. This might demonstrate that the multi-layer expansion of probabilistic generative LDA model is capable of extracting efficient hierarchical topic representations for document classification.

Author supplied keywords

Cite

CITATION STYLE

APA

Jeon, H. B., & Lee, S. Y. (2016). Initializing deep learning based on latent dirichlet allocation for document classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9949 LNCS, pp. 634–641). Springer Verlag. https://doi.org/10.1007/978-3-319-46675-0_70

Initializing deep learning based on latent dirichlet allocation for document classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions