LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition

Tomonari Masada; Atsuhiro Takasu

Conference ProceedingsOPEN ACCESS

LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10862 LNCS 395-402

DOI: 10.1007/978-3-319-93713-7_33

1Citations

1Readers

Abstract

This paper proposes a method of scoring sequences generated by recurrent neural network (RNN) for automatic Tanka composition. Our method gives sequences a score based on topic assignments provided by latent Dirichlet allocation (LDA). When many word tokens in a sequence are assigned to the same topic, we give the sequence a high score. While a scoring of sequences can also be achieved by using RNN output probabilities, the sequences having large probabilities are likely to share much the same subsequences and thus are doomed to be deprived of diversity. The experimental results, where we scored Japanese Tanka poems generated by RNN, show that the top-ranked sequences selected by our method were likely to contain a wider variety of subsequences than those selected by RNN output probabilities.

Author supplied keywords

Cite

CITATION STYLE

APA

Masada, T., & Takasu, A. (2018). LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10862 LNCS, pp. 395–402). Springer Verlag. https://doi.org/10.1007/978-3-319-93713-7_33

LDA-Based Scoring of Sequences Generated by RNN for Automatic Tanka Composition

Abstract

Author supplied keywords

Cite

Register to see more suggestions