Comment spam classification in blogs through comment analysis and comment-blog post relationships

Ashwin Rajadesingan; Anand Mahendran

Conference Proceedings

Comment spam classification in blogs through comment analysis and comment-blog post relationships

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7182 LNCS(PART 2) 490-501

DOI: 10.1007/978-3-642-28601-8_41

4Citations

13Readers

Get full text

Abstract

Spamming refers to the process of providing unwanted and irrelevant information to the users. It is a widespread phenomenon that is often noticed in e-mails, instant messages, blogs and forums. In our paper, we consider the problem of spamming in blogs. In blogs, spammers usually target commenting systems which are provided by the authors to facilitate interaction with the readers. Unfortunately, spammers abuse these commenting systems by posting irrelevant and unsolicited content in the form of spam comments. Thus, we propose a novel methodology to classify comments into spam and non-spam using previously-undescribed features including certain blog post-comment relationships. Experiments conducted using our methodology produced a spam detection accuracy of 94.82% with a precision of 96.50% and a recall of 95.80%. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Rajadesingan, A., & Mahendran, A. (2012). Comment spam classification in blogs through comment analysis and comment-blog post relationships. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7182 LNCS, pp. 490–501). https://doi.org/10.1007/978-3-642-28601-8_41

Comment spam classification in blogs through comment analysis and comment-blog post relationships

Abstract

Author supplied keywords

Cite

Register to see more suggestions