Regular expression tagger for Kannada parts of speech tagging

K. M. Shiva Kumar; Deepa Gupta

Conference Proceedings

Regular expression tagger for Kannada parts of speech tagging

Advances in Intelligent Systems and Computing (2018) 712 121-130

DOI: 10.1007/978-981-10-8228-3_12

2Citations

2Readers

Get full text

Abstract

Part of speech tagging for Indian languages in general and Kannada in particular is not a very widely explored territory. There have been many attempts at developing a good POS tagger for Kannada, but the morphological complexity of the language makes it a hard nut to crack. Some of the best taggers available for Indian languages employ hybrids of machine learning or stochastic methods and linguistic knowledge. Though the results achieved using such methods are good, their practicability for other inflective Indian languages is reduced due to their heavy dependence on linguistic knowledge. Even though taggers can achieve very good results if provided good morphological information, the cost of creating these resources renders such methods impractical. In this paper, we present regular expression parts of speech tagger for Kannada. We apply 100 patterns incorporating the TDIL tags for Kannada and tested for accuracy with manual tagged corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Shiva Kumar, K. M., & Gupta, D. (2018). Regular expression tagger for Kannada parts of speech tagging. In Advances in Intelligent Systems and Computing (Vol. 712, pp. 121–130). Springer Verlag. https://doi.org/10.1007/978-981-10-8228-3_12

Regular expression tagger for Kannada parts of speech tagging

Abstract

Author supplied keywords

Cite

Register to see more suggestions