Deep recurrent neural networks with word embeddings for Urdu named entity recognition

Wahab Khan; Ali Daud; Fahd Alotaibi; Naif Aljohani; Sachi Arafat

Journal ArticleOPEN ACCESS

Deep recurrent neural networks with word embeddings for Urdu named entity recognition

ETRI Journal (2020) 42(1) 90-100

DOI: 10.4218/etrij.2018-0553

24Citations

53Readers

Abstract

Named entity recognition (NER) continues to be an important task in natural language processing because it is featured as a subtask and/or subproblem in information extraction and machine translation. In Urdu language processing, it is a very difficult task. This paper proposes various deep recurrent neural network (DRNN) learning models with word embedding. Experimental results demonstrate that they improve upon current state-of-the-art NER approaches for Urdu. The DRRN models evaluated include forward and bidirectional extensions of the long short-term memory and back propagation through time approaches. The proposed models consider both language-dependent features, such as part-of-speech tags, and language-independent features, such as the “context windows” of words. The effectiveness of the DRNN models with word embedding for NER in Urdu is demonstrated using three datasets. The results reveal that the proposed approach significantly outperforms previous conditional random field and artificial neural network approaches. The best f-measure values achieved on the three benchmark datasets using the proposed deep learning approaches are 81.1%, 79.94%, and 63.21%, respectively.

Author supplied keywords

Cite

CITATION STYLE

APA

Khan, W., Daud, A., Alotaibi, F., Aljohani, N., & Arafat, S. (2020). Deep recurrent neural networks with word embeddings for Urdu named entity recognition. ETRI Journal, 42(1), 90–100. https://doi.org/10.4218/etrij.2018-0553

Deep recurrent neural networks with word embeddings for Urdu named entity recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions