Abstract
This paper presents a method for inducing the parts of speech of a language and part-of-speech labels for individual words from a large text corpus. Vector representations for the part-of-speech of a word are formed from entries of its near lexical neighbors. A dimensionality reduction creates a space representing the syntactic categories of unambiguous words. A neural net trained on these spatial representations classifies individual contexts of occurrence of ambiguous words. The method classifies both ambiguous and unambiguous words correctly with high accuracy.
Cite
CITATION STYLE
Schutze, H. (1993). Part - Of - Speech induction from scratch. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1993-June, pp. 251–258). Association for Computational Linguistics (ACL). https://doi.org/10.3115/981574.981608
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.