Cross-lingual taxonomy alignment with bilingual biterm topic model

16Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

As more and more multilingual knowledge becomes available on the Web, knowledge sharing across languages has become an important task to benefit many applications. One of the most crucial kinds of knowledge on the Web is taxonomy, which is used to organize and classify the Web data. To facilitate knowledge sharing across languages, we need to deal with the problem of cross-lingual taxonomy alignment, which discovers the most relevant category in the target taxonomy of one language for each category in the source taxonomy of another language. Current approaches for aligning crosslingual taxonomies strongly rely on domain-specific information and the features based on string similarities. In this paper, we present a new approach to deal with the problem of cross-lingual taxonomy alignment without using any domain-specific information. We first identify the candidate matched categories in the target taxonomy for each category in the source taxonomy using the crosslingual string similarity. We then propose a novel bilingual topic model, called Bilingual Biterm Topic Model (BiBTM), to perform exact matching. BiBTM is trained by the textual contexts extracted from the Web. We conduct experiments on two kinds of real world datasets. The experimental results show that our approach significantly outperforms the designed state-of-the-art comparison methods.

Cite

CITATION STYLE

APA

Wu, T., Qi, G., Wang, H., Xu, K., & Cui, X. (2016). Cross-lingual taxonomy alignment with bilingual biterm topic model. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 287–293). AAAI press. https://doi.org/10.1609/aaai.v30i1.9979

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free