Rule based parts of speech tagger for Chhattisgarhi language

Vikas Pandey; M. V. Padmavati; Ramesh Kumar

Journal Article

Rule based parts of speech tagger for Chhattisgarhi language

International Journal of Recent Technology and Engineering (2018) 7(4) 192-194

ISSN: 22773878

2Citations

3Readers

Abstract

There is an increasing demand for machine translation systems for various regional languages of India. Chhattisgarhi being the language of the young Chhattisgarh state requires automatic languages translating system. Various types of natural language processing (NLP) tools are required for developing Chhattisgarhi to Hindi machine translation (MT) system. In this paper, we are presenting rule based parts of speech tagger for Chhattisgarhi language. Parts of Speech tagging is a procedure in which each word of sentence is assigned a tag from tag set. The Parts of Speech tagger is based on rule base which is formed by taken into consideration the grammatical structure of Chhattisgarhi language. The system is constructed over corpus size of 40,000 words with tag set consists of 30 different parts of speech tags. The corpus is taken from various Chhattisgarhi stories. The system achieves an accuracy of 78%.

Author supplied keywords

Cite

CITATION STYLE

APA

Pandey, V., Padmavati, M. V., & Kumar, R. (2018). Rule based parts of speech tagger for Chhattisgarhi language. International Journal of Recent Technology and Engineering, 7(4), 192–194.

Rule based parts of speech tagger for Chhattisgarhi language

Abstract

Author supplied keywords

Cite

Register to see more suggestions