A Thesaurus for Natural Language Processing of Japanese

Yoshihiro KOKUBU; Hiroyuki OKANO

Journal ArticleOPEN ACCESS

A Thesaurus for Natural Language Processing of Japanese

KOKUBU Y
OKANO H

Joho Chishiki Gakkaishi (2010) 19(4) 316-327

DOI: 10.2964/jsik.19_316

N/ACitations

3Readers

Abstract

構文解析や用語標準化などの自然言語処理を目的とする440,000語規模のシソーラスを開発した．このシソーラスでは，各用語の持つ関係語の数が膨大なため，観点（ファセット）を導入して分類し，採択した用語を探しやすくしてある．また，差別語，表記の揺れなども区別できるようにするなど工夫をこらしている．本稿ではこのシソーラスについて，パッケージソフトのカスタマイズ機能およびインターネットや他の辞書との連動機能，用語の標準化などについても紹介した．シソーラスを作成する際の留意点と課題をも指摘する． We developed a thesaurus of 440,000 terms for the purpose of natural language processing such as parsing or the term standardization. Because each entry term has a large number of terms with various semantic relations, we introduce a facet and classify them for finding relative terms easily. Furthermore, we distinguish discriminatory terms, and fluctuating Japanese spellings. Our package has the connecting function with the Internet and the other dictionaries. We described points to keep in mind and future tasks in making a thesaurus.

Cite

CITATION STYLE

APA

KOKUBU, Y., & OKANO, H. (2010). A Thesaurus for Natural Language Processing of Japanese. Joho Chishiki Gakkaishi, 19(4), 316–327. https://doi.org/10.2964/jsik.19_316

A Thesaurus for Natural Language Processing of Japanese

Abstract

Cite

Register to see more suggestions