This paper describes a similarity-based technique which produces a good estimate of part-of-speech tags and their morpho-syntactic relations of Chinese compound words before they are fed into a tagger. The technique relies on a set of features from Chinese morphemes as well as a set of collocation markers which provide hints on the syntactic categories of the compound words. The technique is trained with a compound words database with more than 53,500 disyllabic words. Experimental results show the tagger with the technique outperforms its counterpart. © 2013 Springer-Verlag.
CITATION STYLE
Chan, S. W. K., & Chong, M. M. C. (2013). Predicting part-of-speech tags and morpho-syntactic relations using similarity-based technique. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7978 LNAI, pp. 71–82). https://doi.org/10.1007/978-3-642-39593-2_6
Mendeley helps you to discover research relevant for your work.