Punctuation as implicit annotations for Chinese word segmentation

125Citations
Citations of this article
146Readers
Mendeley users who have this article in their library.

Abstract

We present a Chinese word segmentation model learned from punctuation marks which are perfect word delimiters. The learning is aided by a manually segmented corpus. Our method is considerably more effective than previous methods in unknown word recognition. This is a step toward addressing one of the toughest problems in Chinese word segmentation. © 2009 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Li, Z., & Sun, M. (2009). Punctuation as implicit annotations for Chinese word segmentation. Computational Linguistics, 35(4), 505–512. https://doi.org/10.1162/coli.2009.35.4.35403

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free