Japanese Universal Dependencies Corpora

  • Asahara M
  • Kanayama H
  • Miyao Y
  • et al.
N/ACitations
Citations of this article
3Readers
Mendeley users who have this article in their library.

Abstract

Universal Dependencies (UD) is an international project to develop multilingual dependency treebanks in a uniform annotation scheme, aiming at cross lingual learning from multilingual corpora and quantitative comparison of languages. As of mid 2018, more than 100 corpora for about 60 languages have been released. This paper describes the definition of annotations for Japanese. We discuss the localization issues of PoS tags, case marking dependency labels and the difference between phrase and clause in Japanese. We present the issues of coordination structures, which cannot be represented solely by the dependency tree structures. We also report the current status of UD Japanese corpora we have constructed.

Cite

CITATION STYLE

APA

Asahara, M., Kanayama, H., Miyao, Y., Tanaka, T., Omura, M., Murawaki, Y., & Matsumoto, Y. (2019). Japanese Universal Dependencies Corpora. Journal of Natural Language Processing, 26(1), 3–36. https://doi.org/10.5715/jnlp.26.3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free