280 birds with one stone: Inducing multilingual taxonomies from wikipedia using character-level classification

3Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.

Abstract

We propose a novel fully-automated approach towards inducing multilingual taxonomies from Wikipedia. Given an English taxonomy, our approach first leverages the interlanguage links of Wikipedia to automatically construct training datasets for the is-a relation in the target language. Character-level classifiers are trained on the constructed datasets, and used in an optimal path discovery framework to induce high-precision, high-coverage taxonomies in other languages. Through experiments, we demonstrate that our approach significantly outperforms the state-of-the-art, heuristics-heavy approaches for six languages. As a consequence of our work, we release presumably the largest and the most accurate multilingual taxonomic resource spanning over 280 languages.

Cite

CITATION STYLE

APA

Gupta, A., Lebret, R., Harkous, H., & Aberer, K. (2018). 280 birds with one stone: Inducing multilingual taxonomies from wikipedia using character-level classification. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 4824–4831). AAAI press. https://doi.org/10.1609/aaai.v32i1.11921

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free