Active sample selection for named entity transliteration

14Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper introduces a new method for identifying named-entity (NE) transliterations within bilingual corpora. Current state-of-theart approaches usually require annotated data and relevant linguistic knowledge which may not be available for all languages. We show how to effectively train an accurate transliteration classifier using very little data, obtained automatically. To perform this task, we introduce a new active sampling paradigm for guiding and adapting the sample selection process. We also investigate how to improve the classifier by identifying repeated patterns in the training data. We evaluated our approach using English, Russian and Hebrew corpora. © 2008 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Goldwasser, D., & Roth, D. (2008). Active sample selection for named entity transliteration. In ACL-08: HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 53–56). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1557690.1557705

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free