Identification of reduplicated multiword expressions using CRF

6Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper deals with the identification of Reduplicated Multiword Expressions (RMWEs) which is important for any natural language applications like Machine Translation, Information Retrieval etc. In the present task, reduplicated MWEs have been identified in Manipuri language texts using CRF tool. Manipuri is highly agglutinative in nature and reduplication is quite high in this language. The important features selected for running the CRF tool include stem words, number of suffixes, number of prefixes, prefixes in the word, suffixes in the word, Part Of Speech (POS) of the surrounding words, surrounding stem words, length of the word, word frequency and digit feature. Experimental results show the effectiveness of the proposed approach with the overall average Recall, Precision and F-Score values of 92.91%, 91.90% and 92.40% respectively. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Nongmeikapam, K., Laishram, D., Singh, N. B., Chanu, N. M., & Bandyopadhyay, S. (2011). Identification of reduplicated multiword expressions using CRF. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6608 LNCS, pp. 41–51). https://doi.org/10.1007/978-3-642-19400-9_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free