Abstract
Word similarity (WS) is a fundamental and critical task in natural language processing. Existing approaches to WS are mainly to calculate the similarity or relatedness of word pairs based on word embedding obtained by massive and high-quality corpus. However, it may suffer from poor performance for insufficient corpus in some specific fields, and cannot capture rich semantic and sentimental information. To address these above problems, we propose an enhancing embedding-based word similarity evaluation with character-word concepts and synonyms knowledge, namely EWS-CS model, which can provide extra semantic information to enhance word similarity evaluation. The core of our approach contains knowledge encoder and word encoder. In knowledge encoder, we incorporate the semantic knowledge extracted from knowledge resources, including character-word concepts, synonyms and sentiment lexicons, to obtain knowledge representation. Word encoder is to learn enhancing embedding-based word representation from pre-trained model and knowledge representation based on similarity task. Finally, compared with baseline models, the experiments on four similarity evaluation datasets validate the effectiveness of our EWS-CS model in WS task.
Author supplied keywords
Cite
CITATION STYLE
Yin, F., Wang, Y., Liu, J., & Ji, M. (2020). Enhancing embedding-based Chinese word similarity evaluation with concepts and synonyms knowledge. CMES - Computer Modeling in Engineering and Sciences, 124(2), 747–764. https://doi.org/10.32604/cmes.2020.010579
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.