With the evolution of “social” world, people produce a lot of data. Data is being produced everywhere without the inherent knowledge of the people. And, with the incremental usage of social media and e-commerce sites etc., a user produces and consumes a lot of data. The ‘data’ referred to here is not the bandwidth but the text. This text can be in the form of comments, reviews, emails, names, identities, birth dates, offers, claims etc. The problem here is the integrity of data and where its end point is and the sanity. Integrity, although solved by cryptography algorithms, the sanity is always a question mark. Checking if a data is clean is the most crucial part or else a lot of space and valuable resources are wasted. In this paper, we provide a novel way of using Natural Language Processing and Multinomial Naive Bayes algorithm to filter spam before insertion. The model filters spam with an accuracy of about 96 percent.
CITATION STYLE
Eshwar, S., & Lavanya, K. (2019). MONO-spam: An intelligent spam detector based on natural language processing. International Journal of Recent Technology and Engineering, 7(6), 449–457.
Mendeley helps you to discover research relevant for your work.