A language modeling approach for extracting translation knowledge from comparable corpora

Razieh Rahimi; Azadeh Shakery

Conference Proceedings

A language modeling approach for extracting translation knowledge from comparable corpora

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7814 LNCS 606-617

DOI: 10.1007/978-3-642-36973-5_51

5Citations

12Readers

Get full text

Abstract

A main challenge in Cross-Language information retrieval is to estimate a translation language model, as its quality directly affects the retrieval performance. The translation language model is built using translation resources such as bilingual dictionaries, parallel corpora, or comparable corpora. In general, high quality resources may not be available for scarce-resource languages. For these languages, efficient exploitation of commonly available resources such as comparable corpora is considered more crucial. In this paper, we focus on using only comparable corpora to extract translation information more efficiently. We propose a language modeling approach for estimating the translation language model. The proposed method is based on probability distribution estimation, and can be tuned easier in comparison with heuristically adjusted previous work. Experiment results show a significant improvement in the translation quality and CLIR performance compared to the previous approaches. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Rahimi, R., & Shakery, A. (2013). A language modeling approach for extracting translation knowledge from comparable corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7814 LNCS, pp. 606–617). https://doi.org/10.1007/978-3-642-36973-5_51

A language modeling approach for extracting translation knowledge from comparable corpora

Abstract

Author supplied keywords

Cite

Register to see more suggestions