The presence of high quality Named Entity gazetteer within a CLIR system is crucial in order to provide multilingual access to digital resources, particularly in the domain of Digital Libraries. In our paper we investigate an approach for automatically extracting this kind of resources from Wikipedia using an unsupervised approach that leverages the DBpedia classification of the English articles in order to induce the same classification onto encyclopedia pages expressed in other languages. By exploiting the structured information present in Wikipedia we furthermore aim at enriching our standard gazetteer with translations to other languages as well as with the alternative spellings of the entities. © 2011 Springer-Verlag.
CITATION STYLE
Bosca, A., & Dini, L. (2011). Automatic gazetteer generation from wikipedia. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6699 LNCS, pp. 61–71). https://doi.org/10.1007/978-3-642-23160-5_5
Mendeley helps you to discover research relevant for your work.