Abstract
When a large-scale incident or disaster occurs, there is often a great demand for rapidly developing a system to extract detailed and new information from lowresource languages (LLs). We propose a novel approach to discover comparable documents in high-resource languages (HLs), and project Entity Discovery and Linking results from HLs documents back to LLs. We leverage a wide variety of language-independent forms from multiple data modalities, including image processing (image-to-image retrieval, visual similarity and face recognition) and sound matching. We also propose novel methods to learn entity priors from a large-scale HL corpus and knowledge base. Using Hausa and Chinese as the LLs and English as the HL, experiments show that our approach achieves 36.1% higher Hausa name tagging F-score over a costly supervised model, and 9.4% higher Chineseto- English Entity Linking accuracy over state-of-the-art.
Cite
CITATION STYLE
Lu, D., Pan, X., Pourdamghani, N., Chang, S. F., Ji, H., & Knight, K. (2016). A multi-media approach to cross-lingual entity knowledge transfer. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 1, pp. 54–65). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1006
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.