Application of word embedding and machine learning in detecting phishing websites

Rao, R.S.; Umarekar, A.; Pais, A.R.

Application of word embedding and machine learning in detecting phishing websites

dc.contributor.author	Rao, R.S.
dc.contributor.author	Umarekar, A.
dc.contributor.author	Pais, A.R.
dc.date.accessioned	2026-02-04T12:28:39Z
dc.date.issued	2022
dc.description.abstract	Phishing is an attack whose aim is to gain personal information such as passwords, credit card details etc. from online users by deceiving them through fake websites, emails or any legitimate internet service. There exists many techniques to detect phishing sites such as third-party based techniques, source code based methods and URL based methods but still users are getting trapped into revealing their sensitive information. In this paper, we propose a new technique which detects phishing sites with word embeddings using plain text and domain specific text extracted from the source code. We applied various word embedding for the evaluation of our model using ensemble and multimodal approaches. From the experimental evaluation, we observed that multimodal with domain specific text achieved a significant accuracy of 99.34% with TPR of 99.59%, FPR of 0.93%, and MCC of 98.68% © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
dc.identifier.citation	Telecommunication Systems, 2022, 79, 1, pp. 33-45
dc.identifier.issn	10184864
dc.identifier.uri	https://doi.org/10.1007/s11235-021-00850-6
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/22857
dc.publisher	Springer
dc.subject	Codes (symbols)
dc.subject	Computer crime
dc.subject	Embeddings
dc.subject	Fake detection
dc.subject	Machine learning
dc.subject	Websites
dc.subject	Anti-phishing
dc.subject	Domain specific
dc.subject	Hostname
dc.subject	Phishing
dc.subject	Phishing websites
dc.subject	Random forests
dc.subject	Source codes
dc.subject	TF-IDF
dc.subject	URL
dc.subject	Decision trees
dc.title	Application of word embedding and machine learning in detecting phishing websites

Collections

Journal Articles

Application of word embedding and machine learning in detecting phishing websites

Files

Collections