Statistical Machine Translation (SMT) is often used as a black-box in CLIR tasks. We propose an adaptation method for an SMT model relying on the monolingual statistics that can be extracted from the document collection (both source and target if available). We evaluate our approach on CLEF Domain Specific task (German-English and English-German) and show that very simple document collection statistics integrated in SMT translation model allow to obtain good gains both in terms of IR metrics (MAP, P10) and MT evaluation metrics (BLEU, TER). © 2013 Springer-Verlag.
CITATION STYLE
Nikoulina, V., & Clinchant, S. (2013). Domain adaptation of statistical machine translation models with monolingual data for cross lingual information retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7814 LNCS, pp. 768–771). https://doi.org/10.1007/978-3-642-36973-5_80
Mendeley helps you to discover research relevant for your work.