TrackPhish: A Multi-Embedding Attention-Enhanced 1D CNN Model for Phishing URL Detection

Kondaiah, C.; Pais, A.R.; Rao, R.S.

TrackPhish: A Multi-Embedding Attention-Enhanced 1D CNN Model for Phishing URL Detection

dc.contributor.author	Kondaiah, C.
dc.contributor.author	Pais, A.R.
dc.contributor.author	Rao, R.S.
dc.date.accessioned	2026-02-03T13:20:39Z
dc.date.issued	2025
dc.description.abstract	Phishing attacks are a growing threat to online security, with increasingly sophisticated and frequent tactics. This rise in cyber threats underscores the need for advanced detection methods. While the Internet is crucial for modern communication and commerce, it also exposes users to risks such as phishing, spamming, malware, and performance degradation attacks. Among these, malicious URLs, commonly embedded in static links within emails and websites, are a significant challenge in identifying and mitigating these attacks. This study proposes TrackPhish, a novel lightweight application that predicts URL legitimacy without visiting the associated website. The proposed model combines traditional word embeddings (Word2Vec, FastText, GloVe) with transformer models (BERT, RoBERTa, GPT-2) to create a comprehensive feature set fed into a Deep Learning (DL) model for detecting phishing URLs. The integration of these embeddings captures semantic relationships and contextual understanding of the text, generating a robust feature set enhanced by an attention mechanism to choose relevant features. The refined features are then used to train a One-Dimensional Convolutional Neural Network (1D CNN) model for phishing URL detection. The proposed model offers key advantages over existing methods, including independence from third-party features, adaptability for client-side deployment, and target-independent detection. Experimental results demonstrate the model’s effectiveness, achieving 95.41% accuracy with a low false positive rate of 1.44% on our dataset and an impressive 98.55% accuracy on benchmark datasets, outperforming existing baseline models. The proposed model represents a significant advancement over traditional methods, enhancing online security against phishing URLs. © 2005-2012 IEEE.
dc.identifier.citation	IEEE Transactions on Information Forensics and Security, 2025, 20, , pp. 12188-12198
dc.identifier.issn	15566013
dc.identifier.uri	https://doi.org/10.1109/TIFS.2025.3629558
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/20598
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	Deep learning
dc.subject	Economic and social effects
dc.subject	Embeddings
dc.subject	Feature extraction
dc.subject	Malware
dc.subject	Network security
dc.subject	Neural networks
dc.subject	Phishing
dc.subject	Websites
dc.subject	Attention meachnism
dc.subject	CNN models
dc.subject	Convolutional neural network
dc.subject	Neural network model
dc.subject	Phishing URL
dc.subject	Trackphish
dc.subject	Transformer modeling
dc.subject	Word embedding
dc.subject	Semantics
dc.title	TrackPhish: A Multi-Embedding Attention-Enhanced 1D CNN Model for Phishing URL Detection

Collections

Journal Articles

TrackPhish: A Multi-Embedding Attention-Enhanced 1D CNN Model for Phishing URL Detection

Files

Collections