A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods

Most of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yonghui Tang, Fei Wu
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	Applied Bionics and Biomechanics
Online Access:	http://dx.doi.org/10.1155/2022/5036026
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832565734069239808
author	Yonghui Tang Fei Wu
author_facet	Yonghui Tang Fei Wu
author_sort	Yonghui Tang
collection	DOAJ
description	Most of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different nature. Although efforts have been made to establish a general labeling scheme in their classification, there is still limited data labeled in such a format. The usual approaches are based on feature engineering to correctly identify phishing campaigns, exporting lexical, syntactic, and semantic features, e.g., previous phrases. In this context, the most recent approaches have taken advantage of modern neural network architectures to record hidden information at the phrase and text levels, e.g., Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNNs). However, these models lose semantic information related to the specific problem, resulting in a variation in their performance, depending on the different data sets and the corresponding standards used for labeling. In this paper, we propose to extend word embeddings with word vectors that indicate the semantic similarity of each word with each phishing campaigns template tag. These embedded keywords are calculated based on semantic subfields corresponding to each phishing campaign tag, constructed based on the automatic extraction of keywords representing these tags. Combining general word integrations with vectors is calculated based on word similarity using a set of sequential Kalman filters, which can then power any neural architecture such as LSTM or CNN to predict each phishing campaign. Our experiments use a data indicator to evaluate our approach and achieve remarkable results that reinforce the state-of-the-art.
format	Article
id	doaj-art-a75f76c826ee469487378044c50bea34
institution	Kabale University
issn	1754-2103
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	Applied Bionics and Biomechanics
spelling	doaj-art-a75f76c826ee469487378044c50bea342025-02-03T01:06:50ZengWileyApplied Bionics and Biomechanics1754-21032022-01-01202210.1155/2022/5036026A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition MethodsYonghui Tang0Fei Wu1Shaoyang UniversityHunan Institute of EngineeringMost of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different nature. Although efforts have been made to establish a general labeling scheme in their classification, there is still limited data labeled in such a format. The usual approaches are based on feature engineering to correctly identify phishing campaigns, exporting lexical, syntactic, and semantic features, e.g., previous phrases. In this context, the most recent approaches have taken advantage of modern neural network architectures to record hidden information at the phrase and text levels, e.g., Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNNs). However, these models lose semantic information related to the specific problem, resulting in a variation in their performance, depending on the different data sets and the corresponding standards used for labeling. In this paper, we propose to extend word embeddings with word vectors that indicate the semantic similarity of each word with each phishing campaigns template tag. These embedded keywords are calculated based on semantic subfields corresponding to each phishing campaign tag, constructed based on the automatic extraction of keywords representing these tags. Combining general word integrations with vectors is calculated based on word similarity using a set of sequential Kalman filters, which can then power any neural architecture such as LSTM or CNN to predict each phishing campaign. Our experiments use a data indicator to evaluate our approach and achieve remarkable results that reinforce the state-of-the-art.http://dx.doi.org/10.1155/2022/5036026
spellingShingle	Yonghui Tang Fei Wu A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods Applied Bionics and Biomechanics
title	A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
title_full	A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
title_fullStr	A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
title_full_unstemmed	A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
title_short	A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
title_sort	deep learning filter that blocks phishing campaigns using intelligent english text recognition methods
url	http://dx.doi.org/10.1155/2022/5036026
work_keys_str_mv	AT yonghuitang adeeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT feiwu adeeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT yonghuitang deeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT feiwu deeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods

A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods

Similar Items