A Thesaurus for Natural Language Processing of Japanese

  • KOKUBU Y
  • OKANO H
N/ACitations
Citations of this article
3Readers
Mendeley users who have this article in their library.

Abstract

構文解析や用語標準化などの自然言語処理を目的とする440,000語規模のシソーラスを開発した. このシソーラスでは,各用語の持つ関係語の数が膨大なため,観点(ファセット)を導入して分類し, 採択した用語を探しやすくしてある.また,差別語,表記の揺れなども区別できるようにするなど工夫 をこらしている. 本稿ではこのシソーラスについて,パッケージソフトのカスタマイズ機能およびインターネットや他 の辞書との連動機能,用語の標準化などについても紹介した.シソーラスを作成する際の留意点と 課題をも指摘する. We developed a thesaurus of 440,000 terms for the purpose of natural language processing such as parsing or the term standardization. Because each entry term has a large number of terms with various semantic relations, we introduce a facet and classify them for finding relative terms easily. Furthermore, we distinguish discriminatory terms, and fluctuating Japanese spellings. Our package has the connecting function with the Internet and the other dictionaries. We described points to keep in mind and future tasks in making a thesaurus.

Cite

CITATION STYLE

APA

KOKUBU, Y., & OKANO, H. (2010). A Thesaurus for Natural Language Processing of Japanese. Joho Chishiki Gakkaishi, 19(4), 316–327. https://doi.org/10.2964/jsik.19_316

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free