SanskritTagger: A stochastic lexical and POS tagger for sanskrit

24Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Sanskrit Tagger is a stochastic tagger for unpreprocessed Sanskrit text. The tagger tokenises text and performs part-of-speech tagging using a Markov model. Parameters for these processes are estimated from a manually annotated corpus that currently comprises approximately1,500,000 words. This article sketches the tagging process, reports the results of tagging a few short passages of Sanskrit text and describes further improvements of the program.

Cite

CITATION STYLE

APA

Hellwig, O. (2009). SanskritTagger: A stochastic lexical and POS tagger for sanskrit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5402, pp. 266–277). https://doi.org/10.1007/978-3-642-00155-0_11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free