Construction and Analysis of Japanese-English Broadcast News Corpus with Named Entity Tags

2Citations
Citations of this article
70Readers
Mendeley users who have this article in their library.

Abstract

We are aiming to acquire named entity (NE) translation knowledge from nonparallel, content-aligned corpora, by utilizing NE extraction techniques. For this research, we are constructing a Japanese- English broadcast news corpus with NE tags. The tags represent not only NE class information but also coreference information within the same monolingual document and between corresponding Japanese-English document pairs. Analysis of about 1,100 annotated article pairs has shown that if NE occurrence information, such as classes, number of occurrence and occurrence order, is given for each language, it may provide a good clue for corresponding NEs across languages.

Cite

CITATION STYLE

APA

Kumano, T., Kashioka, H., Tanaka, H., & Fukusima, T. (2003). Construction and Analysis of Japanese-English Broadcast News Corpus with Named Entity Tags. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL). https://doi.org/10.3115/1119384.1119387

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free