HowtogetaChineseName(Entity): Segmentation and Combination Issues

24Citations
Citations of this article
77Readers
Mendeley users who have this article in their library.
Get full text

Abstract

When building a Chinese named entity recognition system, one must deal with certain language-specific issues such as whether the model should be based on characters or words. While there is no unique answer to this question, we discuss in detail advantages and disadvantages of each model, identify problems in segmentation and suggest possible solutions, presenting our observations, analysis, and experimental results. The second topic of this paper is classifier combination. We present and describe four classifiers for Chinese named entity recognition and describe various methods for combining their outputs. The results demonstrate that classifier combination is an effective technique of improving system performance: experiments over a large annotated corpus of fine-grained entity types exhibit a 10% relative reduction in F-measure error.

Cite

CITATION STYLE

APA

Jing, H., Florian, R., Luo, X., Zhang, T., & Ittycheriah, A. (2003). HowtogetaChineseName(Entity): Segmentation and Combination Issues. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003 (pp. 200–207). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1119355.1119381

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free