Alignment of bilingual named entities in parallel corpora using statistical model

7Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Named entities make up a bulk of documents. Extracting named entities is crucial to various applications of natural language processing. Although efforts to identify named entities within monolingual documents are numerous, extracting bilingual named entities has not been investigated extensively owing to the complexity of the task. In this paper, we describe a statistical phrase translation model and a statistical transliteration model. Under the proposed models, a new method is proposed to align bilingual named entities in parallel corpora. Experimental results indicate that a satisfactory precision rate can be achieved. To enhance the performance, we also describe how to improve the proposed method by incorporating approximate matching and person name recognition. Experimental results show that performance is significantly improved with the enhancement. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Lee, C. J., Chang, J. S., & Chuang, T. C. (2004). Alignment of bilingual named entities in parallel corpora using statistical model. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3265, 144–153. https://doi.org/10.1007/978-3-540-30194-3_17

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free