Phrase-based statistical model for Korean morpheme segmentation and POS tagging

7Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we propose a novel phrase-based model for Korean morphological analysis by considering a phrase as the basic processing unit, which generalizes all the other existing processing units. The impetus for using phrases this way is largely motivated by the success of phrase-based statistical machine translation (SMT), which convincingly shows that the larger the processing unit, the better the performance. Experimental results using the SEJONG dataset show that the proposed phrasebased models outperform the morpheme-based models used as baselines. In particular, when combined with the conditional random field (CRF) model, our model leads to statistically significant improvements over the state-of-the-art CRF method.

Cite

CITATION STYLE

APA

Na, S. H., & Kim, Y. K. (2018). Phrase-based statistical model for Korean morpheme segmentation and POS tagging. IEICE Transactions on Information and Systems, E101D(2), 512–522. https://doi.org/10.1587/transinf.2017EDP7085

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free