Abstract
Bootstrapping has a tendency, called semantic drift, to select instances unrelated to the seed instances as the iteration proceeds. We demonstrate the semantic drift of Espresso-style bootstrapping has the same root as the topic drift of Kleinberg's HITS, using a simplified graph-based reformulation of bootstrapping. We confirm that two graph-based algorithms, the von Neumann kernels and the regularized Laplacian, can reduce the effect of semantic drift in the task of word sense disambiguation (WSD) on Senseval-3 English Lexical Sample Task. Proposed algorithms achieve superior performance to Espresso and previous graph-based WSD methods, even though the proposed algorithms have less parameters and are easy to calibrate.
Author supplied keywords
Cite
CITATION STYLE
Komachi, M., Kudo, T., Shimbo, M., & Matsumoto, Y. (2010). Semantic drift in espresso-style bootstrapping: Graph-theoretic analysis and evaluation in word sense disambiguation. Transactions of the Japanese Society for Artificial Intelligence, 25(2), 233–242. https://doi.org/10.1527/tjsai.25.233
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.