Picture books have a significant influence on children's language development. However , the sentences in picture books are difficult to analyze automatically. Therefore, to improve the accuracy of the morphological analysis of such sentences, we propose an automatic method to transform existing resources into applicable training data for picture books. In this paper, we first compare picture books with common corpora and then analyze the reasons for the difficulty in morphological analysis. Based on this analysis, we propose a transforming method for existing resources and show its effectiveness using the learning function of an existing morphological analyzer. Second, we perform further experiments using annotated data of picture books themselves. Then we reveal that our proposed method provides us with the same effect, with around 11,000 lines, that is 90,000 morphological annotations of picture books. In addition, we demonstrate an effective annotation strategy by investigating the learning curves and change in error types. In a discussion, we analyze the results focused on a picture book's target ages and difficult to learn words and then further refine our proposed method. Finally, we also briefly consider the applicability of our method to other domains.
CITATION STYLE
Fujita, S., Taira, H., Kobayashi, T., & Tanaka, T. (2014). Japanese Morphological Analysis of Picture Books. Journal of Natural Language Processing, 21(3), 515–539. https://doi.org/10.5715/jnlp.21.515
Mendeley helps you to discover research relevant for your work.