Benchmarking Mi-POS: Malay Part-of-Speech Tagger

  • Xian B
  • Lubani M
  • et al.
N/ACitations
Citations of this article
34Readers
Mendeley users who have this article in their library.

Abstract

A part-of-speech tagger as signs the correct grammatical category to each word in a given text based on the context surrounding the word. This paper presents Mi-POS, a Malay language Part-of-Speech tagger that is developed using a probabilistic approach with information about the context. The results of benchmarking Mi-POS against several similar systems are also presented in this paper and the lessons learnt from it are highlighted. The dataset used for evaluation consists of manually annotated texts. The authors used the accuracy and time to measure the results of this evaluation. The final results show that Mi-POS outperforms other Malay Part-of-Speech taggers in terms of accuracy with an accuracy of 95.16% obtained by tagging new words from the same training corpus type and 81.12% for words from different corpora types

Cite

CITATION STYLE

APA

Xian, B. C. M., Lubani, M., Ping, L. K., Bouzekri, K., Mahmud, R., & Lukose, D. (2016). Benchmarking Mi-POS: Malay Part-of-Speech Tagger. International Journal of Knowledge Engineering, 2(3), 115–121. https://doi.org/10.18178/ijke.2016.2.3.064

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free