Regular expression tagger for Kannada parts of speech tagging

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Part of speech tagging for Indian languages in general and Kannada in particular is not a very widely explored territory. There have been many attempts at developing a good POS tagger for Kannada, but the morphological complexity of the language makes it a hard nut to crack. Some of the best taggers available for Indian languages employ hybrids of machine learning or stochastic methods and linguistic knowledge. Though the results achieved using such methods are good, their practicability for other inflective Indian languages is reduced due to their heavy dependence on linguistic knowledge. Even though taggers can achieve very good results if provided good morphological information, the cost of creating these resources renders such methods impractical. In this paper, we present regular expression parts of speech tagger for Kannada. We apply 100 patterns incorporating the TDIL tags for Kannada and tested for accuracy with manual tagged corpus.

Cite

CITATION STYLE

APA

Shiva Kumar, K. M., & Gupta, D. (2018). Regular expression tagger for Kannada parts of speech tagging. In Advances in Intelligent Systems and Computing (Vol. 712, pp. 121–130). Springer Verlag. https://doi.org/10.1007/978-981-10-8228-3_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free