EMAIL SPAM DETECTION USING SUPPORT VECTOR MACHINES AND NATURAL LANGUAGE PROCESSING

B .Kamatchy, B .Kamatchy and N. Kalaichelvi, N. Kalaichelvi and S.MuthuKumaran, S.MuthuKumaran and S.Prathiba, S.Prathiba (2026) EMAIL SPAM DETECTION USING SUPPORT VECTOR MACHINES AND NATURAL LANGUAGE PROCESSING. In: Fourth International Conference on Cyber Security and Generative Artificial Intelligence, 13.03.2026, SRM, Chennai.

[thumbnail of muthu_srmspammailpaper.pdf] Text
muthu_srmspammailpaper.pdf - Published Version

Download (4MB)

Abstract

The growing use of email communication has led to the massive influx of unwanted and viral communications otherwise known as spam. Spamming must be detected effectively to make sure that an email is not lost and that the email is secure. In this paper, an automated email spam solution has been introduced and it is based on Natural Language Processing (NLP) and a Multi- Kernel Support Vector Model (MK-SVM) model. Term Frequency Inverse Document Frequency (TF-IDF) technique is used to convert email text into numerical features and Support Vector Machines with Linear, Radial Basis Function (RBF), Polynomial, and Sigmoid kernels are used to do the classification. Each of the kernels is tested based on several metrics which include accuracy, precision, recall, specificity and errors rate. The experimental results indicate that the Linear and Sigmoid kernel SVMs are the most accurate in classification with a minimum error of 0.011, as compared to RBF and Polynomial kernels. These results show that the simplicity of kernel functions is better applied to large textual data of high dimensions and it would be the most appropriate to email spam filters based on NLP.

Item Type: Conference or Workshop Item (Paper)
Subjects: Computer Applications > Artificial Intelligence
Domains: Computer Science
Depositing User: Mr IR Admin
Date Deposited: 07 May 2026 16:29
Last Modified: 07 May 2026 16:29
URI: https://ir.vistas.ac.in/id/eprint/14010

Actions (login required)

View Item
View Item