Image-mediated learning for zero-shot cross-lingual document retrieval

Ruka Funaki; Hideki Nakayama

Conference ProceedingsOPEN ACCESS

Image-mediated learning for zero-shot cross-lingual document retrieval

Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (2015) 585-590

DOI: 10.18653/v1/d15-1070

31Citations

109Readers

Abstract

We propose an image-mediated learning approach for cross-lingual document retrieval where no or only a few parallel corpora are available. Using the images in image-text documents of each language as the hub, we derive a common semantic subspace bridging two languages by means of generalized canonical correlation analysis. For the purpose of evaluation, we create and release a new document dataset consisting of three types of data (English text, Japanese text, and images). Our approach substantially enhances retrieval accuracy in zero-shot and few-shot scenarios where text-to-text examples are scarce.

Cite

CITATION STYLE

APA

Funaki, R., & Nakayama, H. (2015). Image-mediated learning for zero-shot cross-lingual document retrieval. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 585–590). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1070

Image-mediated learning for zero-shot cross-lingual document retrieval

Abstract

Cite

Register to see more suggestions