Named entity recognition using an HMM-based chunk tagger

626Citations
Citations of this article
388Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper proposes a Hidden Markov Model (HMM) and an HMM-based chunk tagger, from which a named entity (NE) recognition (NER) system is built to recognize and classify names, times and numerical quantities. Through the HMM, our system is able to apply and integrate four types of internal and external evidences: 1) simple deterministic internal feature of the words, such as capitalization and digitalization; 2) internal semantic feature of important triggers; 3) internal gazetteer feature; 4) external macro context feature. In this way, the NER problem can be resolved effectively. Evaluation of our system on MUC-6 and MUC-7 English NE tasks achieves F-measures of 96.6% and 94.1% respectively. It shows that the performance is significantly better than reported by any other machine-learning system. Moreover, the performance is even consistently better than those based on handcrafted rules.

Cite

CITATION STYLE

APA

Zhou, G. D., & Su, J. (2002). Named entity recognition using an HMM-based chunk tagger. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2002-July, pp. 473–480). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073083.1073163

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free