Improving spamdexing detection via a two-stage classification strategy

Guang Gang Geng; Chun Heng Wang; Qiu Dan Li

Conference Proceedings

Improving spamdexing detection via a two-stage classification strategy

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 4993 LNCS 356-364

DOI: 10.1007/978-3-540-68636-1_34

4Citations

9Readers

Get full text

Abstract

Spamdexing is any of various methods to manipulate the relevancy or prominence of resources indexed by a search engine, usually in a manner inconsistent with the purpose of the indexing system. Combating Spamdexing has become one of the top challenges for web search. Machine learning based methods have shown their superiority for being easy to adapt to newly developed spam techniques. In this paper, we propose a two-stage classification strategy to detect web spam, which is based on the predicted spamicity of learning algorithms and hyperlink propagation. Preliminary experiments on standard WEBSPAM-UK2006 benchmark show that the two-stage strategy is reasonable and effective. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Geng, G. G., Wang, C. H., & Li, Q. D. (2008). Improving spamdexing detection via a two-stage classification strategy. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4993 LNCS, pp. 356–364). https://doi.org/10.1007/978-3-540-68636-1_34

Improving spamdexing detection via a two-stage classification strategy

Abstract

Cite

Register to see more suggestions