A feature extraction method based on word embedding for word similarity computing

Weitai Zhang; Weiran Xu; Guang Chen; Jun Guo

Conference Proceedings

A feature extraction method based on word embedding for word similarity computing

Communications in Computer and Information Science (2014) 496 160-167

DOI: 10.1007/978-3-662-45924-9_15

9Citations

21Readers

Get full text

Abstract

In this paper, we introduce a new NLP task similar to word expansion task or word similarity task, which can discover words sharing the same semantic components (feature sub-space) with seed words. We also propose a Feature Extraction method based on Word Embeddings for this problem. We train word embeddings using state-of-the-art methods like word2vec and models supplied by Stanford NLP Group. Prior Statistical Knowledge and Negative Sampling are proposed and utilized to help extract the Feature Sub-Space. We evaluate our model on WordNet synonym dictionary dataset and compare it to word2vec on synonymy mining and word similarity computing task, showing that our method outperforms other models or methods and can significantly help improve language understanding.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, W., Xu, W., Chen, G., & Guo, J. (2014). A feature extraction method based on word embedding for word similarity computing. In Communications in Computer and Information Science (Vol. 496, pp. 160–167). Springer Verlag. https://doi.org/10.1007/978-3-662-45924-9_15

A feature extraction method based on word embedding for word similarity computing

Abstract

Author supplied keywords

Cite

Register to see more suggestions