DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation

Abstract The rapid development of drone technology has made drones one of the essential tools for acquiring aerial information. The detection and localization of text information through drones greatly enhance their understanding of the environment, enabling tasks of significant importance such as c...

Full description

Saved in:
Bibliographic Details
Main Authors: Jun Liu, Jianxun Zhang, Ting Tang, Shengyuan Wu
Format: Article
Language:English
Published: Springer 2024-11-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-024-01617-7
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571160746786816
author Jun Liu
Jianxun Zhang
Ting Tang
Shengyuan Wu
author_facet Jun Liu
Jianxun Zhang
Ting Tang
Shengyuan Wu
author_sort Jun Liu
collection DOAJ
description Abstract The rapid development of drone technology has made drones one of the essential tools for acquiring aerial information. The detection and localization of text information through drones greatly enhance their understanding of the environment, enabling tasks of significant importance such as community commercial planning and autonomous navigation in intelligent environments. However, the unique perspective and complex environment during drone photography lead to various challenges in text detection, including diverse text shapes, large-scale variations, and background interference, making traditional methods inadequate. To address this issue, we propose a drone-based text detection method based on boundary adaptation. We first conduct an in-depth analysis of text characteristics from a drone’s perspective. Using ResNet50 as the backbone network, we introduce the proposed Hybrid Text Attention Mechanism into the backbone network to enhance the perception of text regions in the feature extraction module. Additionally, we propose a Spatial Feature Fusion Module to adaptively fuse text features of different scales, thereby enhancing the model’s adaptability. Furthermore, we introduce a text detail transformer by incorporating a local feature extractor into the transformer of the text detail boundary iteration optimization module. This enables the precise optimization and localization of text boundaries by reducing the interference of complex backgrounds, eliminating the need for complex post-processing. Extensive experiments on challenging text detection datasets and drone-based text detection datasets validate the high robustness and state-of-the-art performance of our proposed method, laying a solid foundation for practical applications.
format Article
id doaj-art-44347a215a06445bb0de36c73ba9f1a1
institution Kabale University
issn 2199-4536
2198-6053
language English
publishDate 2024-11-01
publisher Springer
record_format Article
series Complex & Intelligent Systems
spelling doaj-art-44347a215a06445bb0de36c73ba9f1a12025-02-02T12:50:04ZengSpringerComplex & Intelligent Systems2199-45362198-60532024-11-0111111410.1007/s40747-024-01617-7DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptationJun Liu0Jianxun Zhang1Ting Tang2Shengyuan Wu3Department of Computer Science and Engineering, Chongqing University of TechnologyDepartment of Computer Science and Engineering, Chongqing University of TechnologySchool of Computer and Communication Engineering, Changsha University of Science & TechnologyDepartment of Computer Science and Engineering, Chongqing University of TechnologyAbstract The rapid development of drone technology has made drones one of the essential tools for acquiring aerial information. The detection and localization of text information through drones greatly enhance their understanding of the environment, enabling tasks of significant importance such as community commercial planning and autonomous navigation in intelligent environments. However, the unique perspective and complex environment during drone photography lead to various challenges in text detection, including diverse text shapes, large-scale variations, and background interference, making traditional methods inadequate. To address this issue, we propose a drone-based text detection method based on boundary adaptation. We first conduct an in-depth analysis of text characteristics from a drone’s perspective. Using ResNet50 as the backbone network, we introduce the proposed Hybrid Text Attention Mechanism into the backbone network to enhance the perception of text regions in the feature extraction module. Additionally, we propose a Spatial Feature Fusion Module to adaptively fuse text features of different scales, thereby enhancing the model’s adaptability. Furthermore, we introduce a text detail transformer by incorporating a local feature extractor into the transformer of the text detail boundary iteration optimization module. This enables the precise optimization and localization of text boundaries by reducing the interference of complex backgrounds, eliminating the need for complex post-processing. Extensive experiments on challenging text detection datasets and drone-based text detection datasets validate the high robustness and state-of-the-art performance of our proposed method, laying a solid foundation for practical applications.https://doi.org/10.1007/s40747-024-01617-7Computer visionAttention mechanismArbitrary shape text detectionTransformerDrone
spellingShingle Jun Liu
Jianxun Zhang
Ting Tang
Shengyuan Wu
DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
Complex & Intelligent Systems
Computer vision
Attention mechanism
Arbitrary shape text detection
Transformer
Drone
title DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
title_full DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
title_fullStr DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
title_full_unstemmed DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
title_short DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
title_sort dadnet text detection of arbitrary shapes from drone perspective based on boundary adaptation
topic Computer vision
Attention mechanism
Arbitrary shape text detection
Transformer
Drone
url https://doi.org/10.1007/s40747-024-01617-7
work_keys_str_mv AT junliu dadnettextdetectionofarbitraryshapesfromdroneperspectivebasedonboundaryadaptation
AT jianxunzhang dadnettextdetectionofarbitraryshapesfromdroneperspectivebasedonboundaryadaptation
AT tingtang dadnettextdetectionofarbitraryshapesfromdroneperspectivebasedonboundaryadaptation
AT shengyuanwu dadnettextdetectionofarbitraryshapesfromdroneperspectivebasedonboundaryadaptation