A counting-based method for massive spam mail classification

Hao Luo; Binxing Fang; Xiaochun Yun

Conference Proceedings

A counting-based method for massive spam mail classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3903 LNCS 45-56

DOI: 10.1007/11689522_5

2Citations

5Readers

Get full text

Abstract

The past research works have explored the effectiveness of machine learning classifiers for filtering spam email, and the results have shown that machine learning classifiers can obtain a high degree of precision and recall. However, these methods cannot avoid classifying normal mail as spam mail for probability characteristics. The evident difference between spam mail and normal mail is that one spam mail will be delivered to many users, while most normal mails have only one single receiver, Based on this observation, this paper presents a server-based massive mail classifier incorporating counting-based classifier, bitmap-based white list (BWL) and grey list to filter massive spam mails. Results show that the spam mail classifier using our method can filter spam with a very low degree of false positive and also preserves performance while coping with large volumes of spam mail. With optimized parameter configuration, our method achieves a precision of 100% and recall of 75.3% in spam mail classification. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Luo, H., Fang, B., & Yun, X. (2006). A counting-based method for massive spam mail classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3903 LNCS, pp. 45–56). Springer Verlag. https://doi.org/10.1007/11689522_5

A counting-based method for massive spam mail classification

Abstract

Cite

Register to see more suggestions