Rule based parts of speech tagger for Chhattisgarhi language

ISSN: 22773878
1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.

Abstract

There is an increasing demand for machine translation systems for various regional languages of India. Chhattisgarhi being the language of the young Chhattisgarh state requires automatic languages translating system. Various types of natural language processing (NLP) tools are required for developing Chhattisgarhi to Hindi machine translation (MT) system. In this paper, we are presenting rule based parts of speech tagger for Chhattisgarhi language. Parts of Speech tagging is a procedure in which each word of sentence is assigned a tag from tag set. The Parts of Speech tagger is based on rule base which is formed by taken into consideration the grammatical structure of Chhattisgarhi language. The system is constructed over corpus size of 40,000 words with tag set consists of 30 different parts of speech tags. The corpus is taken from various Chhattisgarhi stories. The system achieves an accuracy of 78%.

Cite

CITATION STYLE

APA

Pandey, V., Padmavati, M. V., & Kumar, R. (2018). Rule based parts of speech tagger for Chhattisgarhi language. International Journal of Recent Technology and Engineering, 7(4), 192–194.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free