The contribution of lexical resources to natural language processing of CJK languages

4Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The role of lexical resources is often understated in NLP research. The complexity of Chinese, Japanese and Korean (CJK) poses special challenges to developers of NLP tools, especially in the area of word segmentation (WS), information retrieval (IR), named entity extraction (NER), and machine translation (MT). These difficulties are exacerbated by the lack of comprehensive lexical resources, especially for proper nouns, and the lack of a standardized orthography, especially in Japanese. This paper summarizes some of the major linguistic issues in the development NLP applications that are dependent on lexical resources, and discusses the central role such resources should play in enhancing the accuracy of NLP tools. © 2006 Springer-Verlag Berlin/Heidelberg.

Cite

CITATION STYLE

APA

Halpern, J. (2006). The contribution of lexical resources to natural language processing of CJK languages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4274 LNAI, pp. 768–780). https://doi.org/10.1007/11939993_77

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free