Domain adaptation of statistical machine translation models with monolingual data for cross lingual information retrieval

Vassilina Nikoulina; Stéphane Clinchant

Conference Proceedings

Domain adaptation of statistical machine translation models with monolingual data for cross lingual information retrieval

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7814 LNCS 768-771

DOI: 10.1007/978-3-642-36973-5_80

2Citations

8Readers

Get full text

Abstract

Statistical Machine Translation (SMT) is often used as a black-box in CLIR tasks. We propose an adaptation method for an SMT model relying on the monolingual statistics that can be extracted from the document collection (both source and target if available). We evaluate our approach on CLEF Domain Specific task (German-English and English-German) and show that very simple document collection statistics integrated in SMT translation model allow to obtain good gains both in terms of IR metrics (MAP, P10) and MT evaluation metrics (BLEU, TER). © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Nikoulina, V., & Clinchant, S. (2013). Domain adaptation of statistical machine translation models with monolingual data for cross lingual information retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7814 LNCS, pp. 768–771). https://doi.org/10.1007/978-3-642-36973-5_80

Domain adaptation of statistical machine translation models with monolingual data for cross lingual information retrieval

Abstract

Cite

Register to see more suggestions