Exploring word similarity to improve Chinese Personal Name Disambiguation

Xia Yang; Peng Jin; Wei Xiang

Conference Proceedings

Exploring word similarity to improve Chinese Personal Name Disambiguation

Proceedings - 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011 (2011) 3 197-200

DOI: 10.1109/WI-IAT.2011.90

8Citations

8Readers

Get full text

Abstract

This paper presents an approach to the Chinese Personal Name Disambiguation (PND). The key to clustering is the similarity measure of context, which depends on the features selection and representation and calculation method. First HIT Tongyici Cilin (Extended) is introduced to Chinese PND to enhance the clustering effect. Exploration about more word similarity is also performed to alleviate the data sparseness. In this system, a HAC (Hierarchical Agglomerative Clustering) algorithm is adopted to cluster the mentions referring to a same person with features extracted from documents. The results show that the word similarity information is very helpful to improve the system's performance. © 2011 IEEE.

Author supplied keywords

Cite

CITATION STYLE

APA

Yang, X., Jin, P., & Xiang, W. (2011). Exploring word similarity to improve Chinese Personal Name Disambiguation. In Proceedings - 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IAT 2011 (Vol. 3, pp. 197–200). https://doi.org/10.1109/WI-IAT.2011.90

Exploring word similarity to improve Chinese Personal Name Disambiguation

Abstract

Author supplied keywords

Cite

Register to see more suggestions