Email header feature extraction using adaptive and collaborative approach for email classification

ISSN: 22783075
3Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Email Header is footprint of an Email that can be used to examine an Email as HAM or SPAM. Email classification in this research is done on the basis of header features thus by keeping the content privacy of the sender intact [1]. Header features are , email header fields like sender, to, cc, bcc, subject. This research tries to improve the accuracy of the classification by extracting more number of header features. Email Subject is further deeply examined for objectionable keywords for rule matching and rule generation. In our study, we implement an adaptive and collaborative approach by using machine learning and cluster computing for fast classification of Emails as SPAM or HAM. Adaptive approach is to generate new rules for classification and cluster approach is to use parallel computing power for increasing computing speed. New rules are only generated if features extracted from email header do not match the existing rules. Spam Assassin [2][3] is the main dataset used for testing. Collaborative approach creates a parallel environment where multiple antispam methods and divided test corpora are used as input. The false positive and false negative percentage are recorded and accuracy is calculated. Weka Data Mining Software is used to apply the anti-spam methods (available at http://www. cs.waikato.ac.nz /~ml/weka/) [4]

Cite

CITATION STYLE

APA

Rajput, A. S., Sohal, J. S., & Athavale, V. (2019). Email header feature extraction using adaptive and collaborative approach for email classification. International Journal of Innovative Technology and Exploring Engineering, 8(7), 158–164.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free