A complex sentence, divided into clauses, can be analyzed more easily than the complex sentence itself. We present here, the task of identification and classification of clauses in Hindi text. To the best of our knowledge, not much work has been done on clause boundary identification for Hindi, which makes this task more important. We have built a rule based system using linguistic cues such as coordinating conjunct, subordinating conjunct etc. Our system gives 91.53% and 80.63% F1-scores for identification and classification for finite clauses respectively, and 60.57% accuracy for non-finite clauses.
CITATION STYLE
Sharma, R., & Paul, S. (2014). A rule based approach for automatic clause boundary detection and classification in Hindi. In Proceedings of the Conference - 5th Workshop on South and Southeast Asian NLP, WSSANLP 2014 - co-located with the 25th International Conference on Computational Linguistics, COLING 2014 (pp. 102–111). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-5514
Mendeley helps you to discover research relevant for your work.