This paper addresses the issue of the automatic induction of syntactic categories from unannotated corpora. Previous techniques give good results, but fail to cope well with ambiguity or rare words. An algorithm, context distribution clustering (CDC), is presented which can be naturally extended to handle these problems.
CITATION STYLE
Clark, A. (2000). Inducing syntactic categories by context distribution clustering. In Proceedings of the 4th Conference on Computational Natural Language Learning, CoNLL 2000 and of the 2nd Learning Language in Logic Workshop, LLL 2000 - Held in cooperation with ICGI 2000 (pp. 91–94). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1117601.1117621
Mendeley helps you to discover research relevant for your work.