A method to measure the reading difficulty of Japanese words

Keiji Yasuda; Andrew Finch; Eiichiro Sumita

Conference Proceedings

A method to measure the reading difficulty of Japanese words

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6609 LNCS(PART 2) 436-445

DOI: 10.1007/978-3-642-19437-5_36

0Citations

4Readers

Get full text

Abstract

In this paper, we propose an automatic method to measure the reading difficulty of Japanese words. The proposed method uses a statistical transliteration framework, which was inspired by statistical machine translation research. A Dirichlet process model is used for the alignment between single kanji characters and one or more hiragana characters. The joint probability of kanji and hiragana is used to measure the difficulty. In our experiment, we carried out a linear discriminate analysis using three kinds of lexicons: a Japanese place name lexicon, a Japanese last name lexicon and a general noun lexicon. We compared the discrimination ratio given by the proposed method and the conventional method, which estimates a word difficulty based on manually defined kanji difficulty. According to the experimental results, the proposed method performs well for scoring Japanese proper noun reading difficulty. The proposed method produces a higher discrimination ratio with the proper noun lexicons (14 points higher on the place name lexicon and 26.5 points higher on the last name lexicon) than the conventional method. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Yasuda, K., Finch, A., & Sumita, E. (2011). A method to measure the reading difficulty of Japanese words. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6609 LNCS, pp. 436–445). https://doi.org/10.1007/978-3-642-19437-5_36

A method to measure the reading difficulty of Japanese words

Abstract

Cite

Register to see more suggestions