Attention enhanced chinese word embeddings

Xingzhang Ren; Leilei Zhang; Wei Ye; Hang Hua; Shikun Zhang

Conference Proceedings

Attention enhanced chinese word embeddings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11139 LNCS 154-165

DOI: 10.1007/978-3-030-01418-6_16

3Citations

8Readers

Get full text

Abstract

We introduce a new Chinese word embeddings method called AWE by utilizing attention mechanism to enhance Mikolov’s CBOW. Considering the shortcomings of existing word representation methods, we improve CBOW in two aspects. Above all, the context vector in CBOW is obtained by simply averaging the representation of the surrounding words while our AWE model aligns the surrounding words with the central word by global attention mechanism and self attention mechanism. Moreover, CBOW is a bag-of-word model which ignores the order of surrounding words, and this paper uses the position encoding to further enhance AWE and proposes P&AWE. We design both qualitative and quantitative experiments to analyze the effectiveness of the models. Results indicate that the AWE models far exceed the CBOW model, and achieve state-of-the-art performances on the task of word similarity. Last but not least, we also further verify the AWE models through attention visualization and case analysis.

Author supplied keywords

Cite

CITATION STYLE

APA

Ren, X., Zhang, L., Ye, W., Hua, H., & Zhang, S. (2018). Attention enhanced chinese word embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11139 LNCS, pp. 154–165). Springer Verlag. https://doi.org/10.1007/978-3-030-01418-6_16

Attention enhanced chinese word embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions