Automatic amharic part of speech tagging (AAPOST): A comparative approach using bidirectional lstm and conditional random fields (CRF) methods

Worku Kelemework Birhanie; Miriam Butt

Conference Proceedings

Automatic amharic part of speech tagging (AAPOST): A comparative approach using bidirectional lstm and conditional random fields (CRF) methods

Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (2020) 308 LNICST 512-521

DOI: 10.1007/978-3-030-43690-2_37

3Citations

7Readers

Get full text

Abstract

Part of speech (POS) tagging is an initial task for many natural language applications. POS tagging for Amharic is in its infancy. This study contributes towards the improvement of Amharic POS tagging by experimenting using Deep Learning and Conditional Random Fields (CRF) approaches. Word embedding is integrated into the system to enhance performance. The model was applied to an Amharic news corpus tagged into 11 major part of speeches and achieved accuracies of 91.12% and 90% for the Bidirectional LSTM and CRF methods respectively. The result shows that the Bidirectional LSTM approach performance is better than the CRF method. More enhancement is expected in the future by increasing the size and diversity of Amharic corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Birhanie, W. K., & Butt, M. (2020). Automatic amharic part of speech tagging (AAPOST): A comparative approach using bidirectional lstm and conditional random fields (CRF) methods. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 308 LNICST, pp. 512–521). Springer. https://doi.org/10.1007/978-3-030-43690-2_37

Automatic amharic part of speech tagging (AAPOST): A comparative approach using bidirectional lstm and conditional random fields (CRF) methods

Abstract

Author supplied keywords

Cite

Register to see more suggestions