Using n-best lists for named entity recognition from Chinese speech

Lufeng Zhai; Pascale Fung; Richard Schwartz; Marine Carpuat; Dekai Wu

Conference ProceedingsOPEN ACCESS

Using n-best lists for named entity recognition from Chinese speech

HLT-NAACL 2004 - Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Short Papers (2004) 37-40

DOI: 10.3115/1613984.1613994

31Citations

81Readers

Abstract

We present the first known result for named entity recognition (NER) in realistic large-vocabulary spoken Chinese. We establish this result by applying a maximum entropy model, currently the single best known approach for textual Chinese NER, to the recognition output of the BBN LVCSR system on Chinese Broadcast News utterances. Our results support the claim that transferring NER approaches from text to spoken language is a significantly more difficult task for Chinese than for English. We propose re-segmenting the ASR hypotheses as well as applying post-classification to improve the performance. Finally, we introduce a method of using n-best hypotheses that yields a small but nevertheless useful improvement NER accuracy. We use acoustic, phonetic, language model, NER and other scores as confidence measure. Experimental results show an average of 6.7% relative improvement in precision and 1.7% relative improvement in F-measure.

Cite

CITATION STYLE

APA

Zhai, L., Fung, P., Schwartz, R., Carpuat, M., & Wu, D. (2004). Using n-best lists for named entity recognition from Chinese speech. In HLT-NAACL 2004 - Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Short Papers (pp. 37–40). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1613984.1613994

Using n-best lists for named entity recognition from Chinese speech

Abstract

Cite

Register to see more suggestions