Abstract
As it gets easier to add information to the web via html pages, wikis, blogs, and other documents, it gets tougher to distinguish accurate or trustworthy information from inaccurate or untrustworthy information. Moreover, apart from inaccurate or untrustworthy information, we also need to anticipate web spam - where spammers publish false facts and scams to deliberately mislead users. Creating an effective spam detection method is a challenge. In this paper, we use the notion of content trust for spam detection, and regard it as a ranking problem. Evidence is utilized to define the feature of spam web pages, and machine learning techniques are employed to combine the evidence to create a highly efficient and reasonably-accurate spam detection algorithm. Experiments on real web data are carried out, which show the proposed method performs very well in practice. © 2007 International Federation for Information Processing.
Cite
CITATION STYLE
Wang, W., & Zeng, G. (2007). Content trust model for detecting web spam. In IFIP International Federation for Information Processing (Vol. 238, pp. 139–152). https://doi.org/10.1007/978-0-387-73655-6_10
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.