Personalized Classification of Non-Spam Emails Using Machine Learning Techniques

Dinendra, Harsha; Rajapakse, Chathura; Asanka, P. P. G. Dinesh

Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/25424

Title:	Personalized Classification of Non-Spam Emails Using Machine Learning Techniques
Authors:	Dinendra, Harsha Rajapakse, Chathura Asanka, P. P. G. Dinesh
Keywords:	logistic regression, non-spam email classification, Random Forest algorithms, supervised learning, Support Vector Machines
Issue Date:	2022
Publisher:	Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka
Citation:	Dinendra Harsha; Rajapakse Chathura; Asanka P. P. G. Dinesh (2022), Personalized Classification of Non-Spam Emails Using Machine Learning Techniques, International Research Conference on Smart Computing and Systems Engineering (SCSE 2022), Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka. 171-177.
Abstract:	With the advent of computer networks and communications, emails have become one of the most widely accepted communication means, which is faster, more reliable, cheaper, and accessible from anywhere. Due to the increased use of email communications, day-to-day computer users; particularly corporate users, find it cumbersome to filter the most important and urgent emails out of the large number of emails they receive on a given business day. Enterprise email systems are able to automatically identify spam emails but still, there are many non-urgent and unimportant emails among such non-spam emails which cannot be filtered by conventional spam filter programs. Though it may be feasible to set up some static rules and categorize some of the e-mails, the practicality and sustainability of such rules are questionable due to the magnitude of such rules, and the validity period as such rules may become redundant after some time. Thus, it is desired to have an email filtering system for non-spam emails to filter unimportant emails, based on the user’s past behaviour. Despite the availability of research on identifying spam e-mails in the area of further classifying the non-spam e-mails, is lacking. The purpose of this research is to provide a machine learning-based solution to classify non-spam e-mails considering the importance of such e-mails. As part of the research, several machine learning models have been developed and trained using non-spam e-mails, based on the personal mailbox of the first author of this research. The results showed a significant accuracy, particularly with a decision tree, random forests and deep neural network algorithms. This paper presents the modelling details and the results obtained accordingly.
URI:	http://repository.kln.ac.lk/handle/123456789/25424
Appears in Collections:	Smart Computing and Systems Engineering - 2022 (SCSE 2022)

Files in This Item:

File	Description	Size	Format
SCSE 2022 26.pdf		117.62 kB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets