Abstract
Nowadays e-mail spam is not a novelty, but it is still an important rising problem with a big economic impact in society. Spammers manage to circumvent current spam filters and harm the communication system by consuming several resources, damaging the reliability of e-mail as a communication instrument and tricking recipients to react to spam messages. Consequently, spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an active adversary, which constantly attempts to evade filtering. In this paper, we present a novel approach to spam filtering based on theminimum description length principle. Furthermore, we have conducted an empirical experiment on six public and real non-encoded datasets. The results indicate that the proposed filter is fast to construct, incrementally updateable and clearly outperforms the state-of-the-art spam filters. © The Brazilian Computer Society 2012.
Author supplied keywords
Cite
CITATION STYLE
Almeida, T. A., & Yamakami, A. (2012). Occam’s razor-based spam filter. Journal of Internet Services and Applications, 3(3), 245–253. https://doi.org/10.1007/s13174-012-0067-x
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.