In this paper, we introduce a new NLP task similar to word expansion task or word similarity task, which can discover words sharing the same semantic components (feature sub-space) with seed words. We also propose a Feature Extraction method based on Word Embeddings for this problem. We train word embeddings using state-of-the-art methods like word2vec and models supplied by Stanford NLP Group. Prior Statistical Knowledge and Negative Sampling are proposed and utilized to help extract the Feature Sub-Space. We evaluate our model on WordNet synonym dictionary dataset and compare it to word2vec on synonymy mining and word similarity computing task, showing that our method outperforms other models or methods and can significantly help improve language understanding.
CITATION STYLE
Zhang, W., Xu, W., Chen, G., & Guo, J. (2014). A feature extraction method based on word embedding for word similarity computing. In Communications in Computer and Information Science (Vol. 496, pp. 160–167). Springer Verlag. https://doi.org/10.1007/978-3-662-45924-9_15
Mendeley helps you to discover research relevant for your work.