A deep learning‐based attack on text CAPTCHAs by using object detection techniques

Abstract Text‐based CAPTCHAs have been widely deployed by many popular websites, and many have been attacked. However, most previous cracks were based on classification algorithms that typically rely on a series of preprocessing operations or on many training samples, thus making such attacks compli...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jiawei Nian, Ping Wang, Haichang Gao, Xiaoyan Guo
Format:	Article
Language:	English
Published:	Wiley 2022-03-01
Series:	IET Information Security
Subjects:	Web sites feature extraction text analysis character sets human computer interaction deep learning (artificial intelligence)
Online Access:	https://doi.org/10.1049/ise2.12047
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832559614224236544
author	Jiawei Nian Ping Wang Haichang Gao Xiaoyan Guo
author_facet	Jiawei Nian Ping Wang Haichang Gao Xiaoyan Guo
author_sort	Jiawei Nian
collection	DOAJ
description	Abstract Text‐based CAPTCHAs have been widely deployed by many popular websites, and many have been attacked. However, most previous cracks were based on classification algorithms that typically rely on a series of preprocessing operations or on many training samples, thus making such attacks complicated and costly. In this study, a simple, generic, fast and end‐to‐end attack based on advanced object detection technologies is introduced. The proposed attack combines a feature extraction module, a character location and recognition module and a coordinate matching module. The experiments show that the attack can break a wide range of real‐world text CAPTCHAs deployed by the 50 most popular websites on Alexa.com and that the method achieves a high attack accuracy with only 2000 samples at an attack speed of less than 0.10 s. The attack was also evaluated on four click‐based CAPTCHAs that cannot be attacked in the end‐to‐end manner used by previous attacks, and the results demonstrated that within one step, the proposed approach achieves high success rates on both click‐based CAPTCHAs and schemes based on large‐scale character sets, such as Chinese character sets.
format	Article
id	doaj-art-345d6ea410b64ae9aa2aed85daa5463a
institution	Kabale University
issn	1751-8709 1751-8717
language	English
publishDate	2022-03-01
publisher	Wiley
record_format	Article
series	IET Information Security
spelling	doaj-art-345d6ea410b64ae9aa2aed85daa5463a2025-02-03T01:29:41ZengWileyIET Information Security1751-87091751-87172022-03-011629711010.1049/ise2.12047A deep learning‐based attack on text CAPTCHAs by using object detection techniquesJiawei Nian0Ping Wang1Haichang Gao2Xiaoyan Guo3School of Computer Science and Technology Xidian University Xi'an Shaanxi ChinaSchool of Computer Science and Technology Xidian University Xi'an Shaanxi ChinaSchool of Computer Science and Technology Xidian University Xi'an Shaanxi ChinaSchool of Computer Science and Technology Xidian University Xi'an Shaanxi ChinaAbstract Text‐based CAPTCHAs have been widely deployed by many popular websites, and many have been attacked. However, most previous cracks were based on classification algorithms that typically rely on a series of preprocessing operations or on many training samples, thus making such attacks complicated and costly. In this study, a simple, generic, fast and end‐to‐end attack based on advanced object detection technologies is introduced. The proposed attack combines a feature extraction module, a character location and recognition module and a coordinate matching module. The experiments show that the attack can break a wide range of real‐world text CAPTCHAs deployed by the 50 most popular websites on Alexa.com and that the method achieves a high attack accuracy with only 2000 samples at an attack speed of less than 0.10 s. The attack was also evaluated on four click‐based CAPTCHAs that cannot be attacked in the end‐to‐end manner used by previous attacks, and the results demonstrated that within one step, the proposed approach achieves high success rates on both click‐based CAPTCHAs and schemes based on large‐scale character sets, such as Chinese character sets.https://doi.org/10.1049/ise2.12047Web sitesfeature extractiontext analysischaracter setshuman computer interactiondeep learning (artificial intelligence)
spellingShingle	Jiawei Nian Ping Wang Haichang Gao Xiaoyan Guo A deep learning‐based attack on text CAPTCHAs by using object detection techniques IET Information Security Web sites feature extraction text analysis character sets human computer interaction deep learning (artificial intelligence)
title	A deep learning‐based attack on text CAPTCHAs by using object detection techniques
title_full	A deep learning‐based attack on text CAPTCHAs by using object detection techniques
title_fullStr	A deep learning‐based attack on text CAPTCHAs by using object detection techniques
title_full_unstemmed	A deep learning‐based attack on text CAPTCHAs by using object detection techniques
title_short	A deep learning‐based attack on text CAPTCHAs by using object detection techniques
title_sort	deep learning based attack on text captchas by using object detection techniques
topic	Web sites feature extraction text analysis character sets human computer interaction deep learning (artificial intelligence)
url	https://doi.org/10.1049/ise2.12047
work_keys_str_mv	AT jiaweinian adeeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT pingwang adeeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT haichanggao adeeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT xiaoyanguo adeeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT jiaweinian deeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT pingwang deeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT haichanggao deeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques AT xiaoyanguo deeplearningbasedattackontextcaptchasbyusingobjectdetectiontechniques

A deep learning‐based attack on text CAPTCHAs by using object detection techniques

Similar Items