Weak semi-Markov CRFs for NP chunking in informal text

16Citations
Citations of this article
93Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper introduces a new annotated corpus based on an existing informal text corpus: the NUS SMS Corpus (Chen and Kan, 2013). The new corpus includes 76,490 noun phrases from 26,500 SMS messages, annotated by university students. We then explored several graphical models, including a novel variant of the semi-Markov conditional random fields (semi-CRF) for the task of noun phrase chunking. We demonstrated through empirical evaluations on the new dataset that the new variant yielded similar accuracy but ran in significantly lower running time compared to the conventional semi-CRF.

Cite

CITATION STYLE

APA

Muis, A. O., & Lu, W. (2016). Weak semi-Markov CRFs for NP chunking in informal text. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 714–719). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/N16-1085

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free