Abstract
Chinese chunking has traditionally been solved by assuming gold standard word segmentation. We find that the accuracies drop drastically when automatic segmentation is used. Inspired by the fact that chunking knowledge can potentially improve segmentation, we explore a joint model that performs segmentation, POStagging and chunking simultaneously. In addition, to address the sparsity of full chunk features, we employ a semi-supervised method to derive chunk cluster features from large-scale automatically-chunked data. Results show the effectiveness of the joint model with semi-supervised features.
Cite
CITATION STYLE
Lyu, C., Zhang, Y., & Ji, D. (2016). Joint word segmentation, POS-Tagging and syntactic chunking. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 3007–3014). AAAI press. https://doi.org/10.1609/aaai.v30i1.10369
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.