ANEAR: Automatic named entity aliasing resolution

Ayah Zirikly; Mona Diab

Conference Proceedings

ANEAR: Automatic named entity aliasing resolution

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7934 LNCS 213-224

DOI: 10.1007/978-3-642-38824-8_18

0Citations

8Readers

Get full text

Abstract

Identifying the different aliases used by or for an entity is emerging as a significant problem in reliable Information Extraction systems, especially with the proliferation of social media and their ever growing impact on different aspects of modern life such as politics, finance, security, etc. In this paper, we address the novel problem of Named Entity Aliasing Resolution (NEAR). We attempt to solve the NEAR problem in a language-independent setting by extracting the different aliases and variants of person named entities. We generate feature vectors for the named entities by building co-occurrence models that use different weighting schemes. The aliasing resolution process applies unsupervised machine learning techniques over the vector space models in order to produce groups of entities along with their aliases. We test our approach on two languages: Arabic and English. We study the impact of varying the level of morphological preprocessing of the words, as well as the part of speech tags surrounding the person named entities, and the named entities' distribution in the data set. We create novel evaluation data sets for both languages. NEAR yields better overall performance in Arabic than in English for comparable amounts of data, effectively using the POS tag information to improve performance. Our approach achieves an F β = 1score of 67.85% and 70.03% for raw English and Arabic data sets, respectively. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Zirikly, A., & Diab, M. (2013). ANEAR: Automatic named entity aliasing resolution. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7934 LNCS, pp. 213–224). https://doi.org/10.1007/978-3-642-38824-8_18

ANEAR: Automatic named entity aliasing resolution

Abstract

Cite

Register to see more suggestions