IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Detecting Phishing URLs With Word Embedding and Deep Learning

Detecting Phishing URLs With Word Embedding and Deep Learning
View Sample PDF
Author(s): Ali Selamat (Universiti Teknologi Malaysia, Malaysia & Hradec Kralove University, Czech Republic), Nguyet Quang Do (Universiti Teknologi Malaysia, Malaysia)and Ondrej Krejcar (Hradec Kralove University, Czech Republic)
Copyright: 2023
Pages: 24
Source title: Perspectives and Considerations on the Evolution of Smart Systems
Source Author(s)/Editor(s): Maki K. Habib (American University in Cairo, Egypt)
DOI: 10.4018/978-1-6684-7684-0.ch011

Purchase

View Detecting Phishing URLs With Word Embedding and Deep Learning on the publisher's website for pricing and purchasing information.

Abstract

The past decade has witnessed the rapid development of natural language processing and machine learning in the phishing detection domain. However, there needs to be more research on word embedding and deep learning for malicious URL classification. Inspired to solve this problem, this chapter aims to examine the application of word embedding and deep learning in extracting features from website URLs. To achieve this, several word embedding techniques, such as Keras, Word2Vec, GloVe, and FastText, were used to learn feature representations of webpage URLs. The obtained feature vectors were fed into a deep-learning model based on CNN-BiGRU for extraction and classification. Two different datasets were used to conduct numerous experiments, while various metrics were utilized to evaluate the phishing detection model's performance. The obtained findings indicated that when combined with deep learning, Keras outperformed other text embedding methods and achieved the best results across all evaluation metrics on both datasets.

Related Content

Babita Srivastava. © 2024. 21 pages.
Sakuntala Rao, Shalini Chandra, Dhrupad Mathur. © 2024. 27 pages.
Satya Sekhar Venkata Gudimetla, Naveen Tirumalaraju. © 2024. 24 pages.
Neeta Baporikar. © 2024. 23 pages.
Shankar Subramanian Subramanian, Amritha Subhayan Krishnan, Arumugam Seetharaman. © 2024. 35 pages.
Charu Banga, Farhan Ujager. © 2024. 24 pages.
Munir Ahmad. © 2024. 27 pages.
Body Bottom