To automatically extract Chinese collocations and build a large-scale collocation bank, we are developing a one-million-word Chinese shallow parsed treebank. The treebank can be used not only as a training set for our shallow parser, but also as processed data from which collocations are extracted. This paper presents several issues related to this on-going project, such as our definition of shallow parsing used in Chinese collocation extraction, guideline preparation, and quality control.
CITATION STYLE
Li, B., Qin, L., & Yin, L. (2003). Building a Chinese shallow parsed treebank for collocation extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2588, pp. 402–405). Springer Verlag. https://doi.org/10.1007/3-540-36456-0_41
Mendeley helps you to discover research relevant for your work.