Chinese HowNet-based multi-factor word similarity algorithm integrated of result modification

5Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we firstly describe a novel approach to calculate the Chinese sememe similarity based on the HowNet hierarchical sememe tree. When we calculate the sememe similarity, we not only take Semantic Distance, Node Depth and Semantic Coincidence Degree into consideration, but also propose two impact factors named Node Environment Dense (NED) and Node Layer Ratio (NLR) to optimize the calculation process. Secondly, quite a few words described by identical concept definition in HowNet should have a certain discrimination according to human perception, so we propose a hybrid modification algorithm integrated of TongYiCi CiLin (hereinafter, CiLin) to deal with this case. Experiment results of the HowNet-based multi-factor similarity hybrid algorithm shows that this approach improves the similarity of independent sememe words and the words having identical concept descriptions in HowNet, while no large bias influence on the similarity of other words. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Wu, B., Yang, J., & He, L. (2012). Chinese HowNet-based multi-factor word similarity algorithm integrated of result modification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7667 LNCS, pp. 256–266). https://doi.org/10.1007/978-3-642-34500-5_31

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free