A hybrid approach for extracting arabic persons’ names and resolving their ambiguity from twitter

Omnia H. Zayed; Samhaa R. El-Beltagy

Conference Proceedings

A hybrid approach for extracting arabic persons’ names and resolving their ambiguity from twitter

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9103 355-368

DOI: 10.1007/978-3-319-19581-0_32

N/ACitations

14Readers

Get full text

Abstract

Tweets offer a novel way of communication that enables users all over the world to share real-time news and ideas. The massive amount of tweets, generated regularly by Arabic speakers, has resulted in a growing interest in building Arabic named entity recognition (NER) systems that deal with the informal colloquial Arabic. The unique characteristics of the Arabic language make Arabic NER a challenging task, which, the informal nature of tweets further complicates. The majority of previous works addressing Arabic NER were concerned with formal modern standard Arabic (MSA). Moreover, taggers and parsers were often utilized to solve the ambiguity problem of Arabic persons’ names. Although, previously developed approaches perform well on MSA text, they are not suited for colloquial Arabic. This paper introduces a hybrid approach to extract Arabic persons’ names from tweets in addition to a way to resolve their ambiguity using context bigram patterns. The introduced approach attempts not to use any language-dependent resources. Evaluation of the presented approach shows a 7% improvement in the F-score over the best reported result in the literature.

Cite

CITATION STYLE

APA

Zayed, O. H., & El-Beltagy, S. R. (2015). A hybrid approach for extracting arabic persons’ names and resolving their ambiguity from twitter. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9103, pp. 355–368). Springer Verlag. https://doi.org/10.1007/978-3-319-19581-0_32

A hybrid approach for extracting arabic persons’ names and resolving their ambiguity from twitter

Abstract

Cite

Register to see more suggestions