Benchmarking hybrid architectures of BERT-based embeddings with CNN and LSTM for real-time phishing URL detection

dc.contributor.authorMaduwanthi, W. V. C.
dc.contributor.authorTharaka, Y. M. S.
dc.contributor.authorHewapathirana, I. U.
dc.date.accessioned2025-10-29T13:25:19Z
dc.date.issued2025
dc.description.abstractPhishing and malicious URL attacks are escalating threats in the digital landscape, demanding real-time detection. Current research is constrained by feature-engineered datasets and limited model combinations. This study introduces a novel, exhaustive investigation into hybrid deep learning architectures for malicious URL classification, using raw URL strings without pre-extracted features. We combine pre-trained transformers (BERT, URLBERT, DomURLs-BERT) with Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks to capture semantic and sequential/spatial patterns. Six models are evaluated on a large benchmark dataset. Comparative analysis reveals performance differentials, identifying the most effective architecture based on accuracy and F1 score. This work provides a comprehensive benchmarking framework and demonstrates the promise of BERT-based string-level processing for real-world phishing detection.
dc.identifier.citationMaduwanthi, W. V. C., Tharaka, Y. M. S. & Hewapathirana, I. U.(2025). Benchmarking hybrid architectures of BERT-based embeddings with CNN and LSTM for real-time phishing URL detection, International Conference on Applied and Pure Sciences, 2025. 330-335
dc.identifier.urihttp://repository.kln.ac.lk/handle/123456789/30149
dc.language.isoen
dc.publisherInternational Conference on Applied and Pure Sciences, 2025
dc.subjectCybersecurity
dc.subjectHybrid Deep Learning
dc.subjectMalicious URL Classification
dc.subjectPhishing Detection
dc.subjectTransformer Models
dc.titleBenchmarking hybrid architectures of BERT-based embeddings with CNN and LSTM for real-time phishing URL detection
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Maduwanthi et al_2025_2.pdf
Size:
319.3 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: