Semi-supervised latent Dirichlet allocation for multi-label text classification

Youwei Lu; Shogo Okada; Katsumi Nitta

Conference Proceedings

Semi-supervised latent Dirichlet allocation for multi-label text classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7906 LNAI 351-360

DOI: 10.1007/978-3-642-38577-3_36

11Citations

22Readers

Get full text

Abstract

This paper proposes a semi-supervised latent Dirichlet allocation (ssLDA) method, which differs from the existing supervised topic models for multi-label classification in mainly two aspects. Firstly both labeled and unlabeled learning data are used in ssLDA to train a model, which is very important for reducing the cost by manually labeling, especially when obtaining a fully labeled dataset is difficult. Secondly ssLDA provides a more flexible training scheme that allows two ways of labeling assignment while existing topic model-based methods usually focus on either of them: (1) a document-level assignment of labels to a document; (2) imposing word-level correspondences between words and labels within a document. Our experiment results indicate that ssLDA gains an advantage over other methods in implementation flexibility and can outperform others in terms of multi-label classification performance. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Lu, Y., Okada, S., & Nitta, K. (2013). Semi-supervised latent Dirichlet allocation for multi-label text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7906 LNAI, pp. 351–360). https://doi.org/10.1007/978-3-642-38577-3_36

Semi-supervised latent Dirichlet allocation for multi-label text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions