Abstract
This paper presents an ongoing project whose goal is to create a freely available dependency treebank for Persian. The data is taken from the Bijankhan corpus, which is already annotated for parts of speech, and a syntactic dependency annotation based on the Stanford Typed Dependencies is added through a bootstrapping procedure involving the open-source dependency parser MaltParser. We report preliminary parsing experiments with promising results after training the parser on a manually annotated seed data set of 215 sentences.
Cite
CITATION STYLE
Seraji, M., Megyesi, B., & Nivre, J. (2012). Bootstrapping a Persian Dependency Treebank. Linguistic Issues in Language Technology, 7. https://doi.org/10.33011/lilt.v7i.1297
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.