A privacy-preserving explainable AI framework for phishing URL detection
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Faculty of Graduate Studies, University of Kelaniya, Sri Lanka.
Abstract
In the era of artificial intelligence (AI), achieving a balance between user privacy and the interpretability of Al systems remains challenging. This paper presents a novel framework that bridges explainable artificial intelligence (XAI) with privacy-preserving (PP) methods to enhance security and trust in phishing URL detection. Phishing continues to be a major cyber threat, with the anti-phishing working group (APWG) reporting 877,536 incidents in the second quarter of 2024. The framework presents PP techniques, such as differential privacy, federated learning, and homomorphic encryption, and the techniques made available through XAI, such as SHapley Additive exPlanations (SHAP). This strategy enables the protection of these sensitive details (i.e., URLs) from exposure in both model training and prediction, while still ensuring that the inner workings of machine learning (ML) methods remain interpretable to the users. It makes use of the University of California - Irvine (UCI) Phishing Websites Dataset, which consists of 11,055 records of authentic and phishing URLs and includes 30 attributes such as IP address inclusion, URL length, and Secure Sockets Layer (SSL) certificate status. A range of ML models such as XGBoost, Random Forest, GBDT and logistic regression were developed to classify the URLs. The model accuracy, precision, and recall metrics relative to one another, were used to determine the performance level of the models. SHAP explains what, while addressing levels of explanations both types as in the global or individual level. Visual aids including force graphs and SHAP summaries would assist cybersecurity professionals in making sense of these decisions, which would ultimately enhance the interpretability of the model. This proposed framework demonstrates how the growing phishing attacks can be mitigated within reasonable limits while maintaining transparency by integrating PP techniques and XAI thereby revolutionizing the cybersecurity space.
Description
Citation
Madushanka, M. K. P., & Liyanage, S. R. (2024). A privacy-preserving explainable AI framework for phishing URL detection. International Postgraduate Research Conference (IPRC) - 2024. Faculty of Graduate Studies, University of Kelaniya, Sri Lanka. (p. 39).