Kwja: A unified japanese analyzer based on foundation models

5Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present KWJA, a high-performance unified Japanese text analyzer based on foundation models. KWJA supports a wide range of tasks, including typo correction, word segmentation, word normalization, morphological analysis, named entity recognition, linguistic feature tagging, dependency parsing, PAS analysis, bridging reference resolution, coreference resolution, and discourse relation analysis, making it the most versatile among existing Japanese text analyzers. KWJA solves these tasks in a multi-Task manner but still achieves competitive or better performance compared to existing analyzers specialized for each task. KWJA is publicly available under the MIT license at https://github.com/ku-nlp/kwja.

Cite

CITATION STYLE

APA

Ueda, N., Omura, K., Kodama, T., Kiyomaru, H., Murawaki, Y., Kawahara, D., & Kurohashi, S. (2023). Kwja: A unified japanese analyzer based on foundation models. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 3, pp. 538–548). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-demo.52

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free