SanskritTagger: A stochastic lexical and POS tagger for sanskrit

Oliver Hellwig

Conference Proceedings

SanskritTagger: A stochastic lexical and POS tagger for sanskrit

Hellwig O

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5402 266-277

DOI: 10.1007/978-3-642-00155-0_11

24Citations

5Readers

Get full text

Abstract

Sanskrit Tagger is a stochastic tagger for unpreprocessed Sanskrit text. The tagger tokenises text and performs part-of-speech tagging using a Markov model. Parameters for these processes are estimated from a manually annotated corpus that currently comprises approximately1,500,000 words. This article sketches the tagging process, reports the results of tagging a few short passages of Sanskrit text and describes further improvements of the program.

Cite

CITATION STYLE

APA

Hellwig, O. (2009). SanskritTagger: A stochastic lexical and POS tagger for sanskrit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5402, pp. 266–277). https://doi.org/10.1007/978-3-642-00155-0_11

SanskritTagger: A stochastic lexical and POS tagger for sanskrit

Abstract

Cite

Register to see more suggestions