Cross-lingual taxonomy alignment with bilingual biterm topic model

Tianxing Wu; Guilin Qi; Haofen Wang; Kang Xu; Xuan Cui

Conference ProceedingsOPEN ACCESS

Cross-lingual taxonomy alignment with bilingual biterm topic model

30th AAAI Conference on Artificial Intelligence, AAAI 2016 (2016) 287-293

DOI: 10.1609/aaai.v30i1.9979

16Citations

22Readers

Abstract

As more and more multilingual knowledge becomes available on the Web, knowledge sharing across languages has become an important task to benefit many applications. One of the most crucial kinds of knowledge on the Web is taxonomy, which is used to organize and classify the Web data. To facilitate knowledge sharing across languages, we need to deal with the problem of cross-lingual taxonomy alignment, which discovers the most relevant category in the target taxonomy of one language for each category in the source taxonomy of another language. Current approaches for aligning crosslingual taxonomies strongly rely on domain-specific information and the features based on string similarities. In this paper, we present a new approach to deal with the problem of cross-lingual taxonomy alignment without using any domain-specific information. We first identify the candidate matched categories in the target taxonomy for each category in the source taxonomy using the crosslingual string similarity. We then propose a novel bilingual topic model, called Bilingual Biterm Topic Model (BiBTM), to perform exact matching. BiBTM is trained by the textual contexts extracted from the Web. We conduct experiments on two kinds of real world datasets. The experimental results show that our approach significantly outperforms the designed state-of-the-art comparison methods.

Cite

CITATION STYLE

APA

Wu, T., Qi, G., Wang, H., Xu, K., & Cui, X. (2016). Cross-lingual taxonomy alignment with bilingual biterm topic model. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 287–293). AAAI press. https://doi.org/10.1609/aaai.v30i1.9979

Cross-lingual taxonomy alignment with bilingual biterm topic model

Abstract

Cite

Register to see more suggestions