This paper focuses on the problem of designing effective spam filters using combined Näive Bayes classifiers. Firstly, we describe different tokenization methods which allow us for extracting valuable features from the e-mails. The methods are used to create training sets for individual Bayesian classifiers, because different methods of feature extraction ensure the desirable diversity of classifier ensemble. Because of the lack of an adequate analytical methods of ensemble evaluation the most valuable and diverse committees are chosen on the basis of computer experiments which are carried out on the basis of our own spam dataset. Then the number of well known fusion methods using class labels and class supports are compared to establish the final proposition. © 2013 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Wrótniak, K., & Woźniak, M. (2013). Combined Bayesian classifiers applied to spam filtering problem. In Advances in Intelligent Systems and Computing (Vol. 189 AISC, pp. 253–260). Springer Verlag. https://doi.org/10.1007/978-3-642-33018-6_26
Mendeley helps you to discover research relevant for your work.