A semi-supervised feature clustering algorithm with application toword sense disambiguation

Zheng Yu Niu; Dong Hong Ji; Chew Lim Tan

Conference ProceedingsOPEN ACCESS

A semi-supervised feature clustering algorithm with application toword sense disambiguation

HLT/EMNLP 2005 - Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (2005) 907-914

DOI: 10.3115/1220575.1220689

4Citations

86Readers

Abstract

In this paper we investigate an application of feature clustering for word sense disambiguation, and propose a semisupervised feature clustering algorithm. Compared with other feature clustering methods (ex. supervised feature clustering), it can infer the distribution of class labels over (unseen) features unavailable in training data (labeled data) by the use of the distribution of class labels over (seen) features available in training data. Thus, it can deal with both seen and unseen features in feature clustering process. Our experimental results show that feature clustering can aggressively reduce the dimensionality of feature space, while still maintaining state of the art sense disambiguation accuracy. Furthermore, when combined with a semi-supervised WSD algorithm, semi-supervised feature clustering outperforms other dimensionality reduction techniques, which indicates that using unlabeled data in learning process helps to improve the performance of feature clustering and sense disambiguation. © 2005 Association for Computational Linguistics.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Niu, Z. Y., Ji, D. H., & Tan, C. L. (2005). A semi-supervised feature clustering algorithm with application toword sense disambiguation. In HLT/EMNLP 2005 - Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 907–914). https://doi.org/10.3115/1220575.1220689

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 36

71%

Researcher 11

22%

Professor / Associate Prof. 2

Lecturer / Post doc 2

Readers' Discipline

Computer Science 43

83%

Linguistics 5

10%

Social Sciences 2

Engineering 2

A semi-supervised feature clustering algorithm with application toword sense disambiguation

Abstract

References Powered by Scopus

Indexing by latent semantic analysis

Divergence Measures Based on the Shannon Entropy

Co-clustering documents and words using bipartite spectral graph partitioning

Cited by Powered by Scopus

Information-maximization clustering based on squared-lossmutual information

Semi-supervised Chinese contextual polarity classification with automatic feature selection

Semi-supervised clustering forword instances and its effect on word sense disambiguation

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline