GDText-VM: an arbitrary-shaped scene text detector based on globally deformable VMamba

GDText-VM: an arbitrary-shaped scene text detector based on globally deformable VMamba

Abstract Detecting arbitrary-shaped text in natural scenes remains a significant challenge in deep learning research. Contemporary text detectors based on Convolutional Neural Networks face challenges in effectively modeling long-range dependencies. While Vision Transformers theoretically enable glo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yingnan Zhao, Zheng Hu, Fangqi Ding, Jielin Jiang, Xiaolong Xu
Format:	Article
Language:	English
Published:	Springer 2025-06-01
Series:	Complex & Intelligent Systems
Subjects:	Computer vision Globally Deformable VMamba Attention mechanism Scene text detection
Online Access:	https://doi.org/10.1007/s40747-025-01987-6
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Cascaded Dual-Inpainting Network for Scene Text
by: Chunmei Liu
Published: (2025-07-01)

A VMamba-Based Spatial–Spectral Fusion Network for Remote Sensing Image Classification
by: Lan Luo, et al.
Published: (2025-01-01)

Turkish scene text recognition: Introducing extensive real and synthetic datasets and a novel recognition model
by: Serdar Yıldız
Published: (2024-12-01)

A text clarification and deep relational reasoning method for Mongolian-Chinese bilingual arbitrary-shaped scene text detection
by: Yuefeng Liu, et al.
Published: (2025-07-01)

VMMCD: VMamba-Based Multi-Scale Feature Guiding Fusion Network for Remote Sensing Change Detection
by: Zhong Chen, et al.
Published: (2025-05-01)

Text Font Correction and Alignment Method for Scene Text Recognition
by: Liuxu Ding, et al.
Published: (2024-12-01)

Leveraging text semantics for enhanced scene text image super-resolution
by: Li Chen, et al.
Published: (2025-06-01)

CLIP-Llama: A New Approach for Scene Text Recognition with a Pre-Trained Vision-Language Model and a Pre-Trained Language Model
by: Xiaoqing Zhao, et al.
Published: (2024-11-01)

Apvit: ViT with adaptive patches for scene text recognition
by: Ning Zhang, et al.
Published: (2025-03-01)

KSTRV1: A scene text recognition dataset for central Kurdish in (Arabic-Based) scriptZenodo
by: Sardar Omar Salih, et al.
Published: (2025-06-01)

Leveraging Text Signed Distance Function Map for Boundary-Aware Guidance in Scene Text Segmentation
by: Ho Jun Kim, et al.
Published: (2025-01-01)

MSER Fast Skewed Scene-text Location Algorithm
by: ZHANG Kai-yu, et al.
Published: (2019-04-01)

LFEN: A language feature enhanced network for scene text recognition
by: Hui Chen, et al.
Published: (2025-01-01)

Single-Character-Based Embedding Feature Aggregation Using Cross-Attention for Scene Text Super-Resolution
by: Meng Wang, et al.
Published: (2025-04-01)

Better Skeleton Better Readability: Scene Text Image Super-Resolution via Skeleton-Aware Diffusion Model
by: Shrey Singh, et al.
Published: (2024-01-01)

Toward AI-Enabled Approach for Urdu Text Recognition: A Legacy for Urdu Image Apprehension
by: Kamlesh Narwani, et al.
Published: (2025-01-01)

Scene Text Recognition That Eliminates Background and Character Noise Interference
by: Shancheng Tang, et al.
Published: (2025-03-01)

Text-Guided Diverse Scene Interaction Synthesis by Disentangling Actions From Scenes
by: Hitoshi Teshima, et al.
Published: (2025-01-01)

Deep Learning Small Water Body Mapping by Transfer Learning from Sentinel-2 to PlanetScope
by: Yuyang Li, et al.
Published: (2025-08-01)

DADNet: text detection of arbitrary shapes from drone perspective based on boundary adaptation
by: Jun Liu, et al.
Published: (2024-11-01)

Rough-and-Refine Model for Scene Graph Generation
by: Li Junliang, et al.
Published: (2025-01-01)

MAPE-ViT: multimodal scene understanding with novel wavelet-augmented Vision Transformer
by: Muhammad Waqas Ahmed, et al.
Published: (2025-05-01)

Streaming LiDAR Scene Flow Estimation
by: Mazen Abdelfattah, et al.
Published: (2025-01-01)

Three-Dimensional Real-Scene-Enhanced GNSS/Intelligent Vision Surface Deformation Monitoring System
by: Yuanrong He, et al.
Published: (2025-04-01)

Text Detection Method With Emphasis on Text Component Importance and Lightweight Design
by: Lanlan Yin, et al.
Published: (2024-01-01)

Hybrid pre trained model based feature extraction for enhanced indoor scene classification in federated learning environments
by: Monica Dutta, et al.
Published: (2025-08-01)

BiFormer for Scene Graph Generation Based on VisionNet With Taylor Hiking Optimization Algorithm
by: S. Monesh, et al.
Published: (2025-01-01)

SPIN-SGG: spatial integration for open-vocabulary scene graph generation
by: Nanhao Liang, et al.
Published: (2025-08-01)

Are vision transformers replacing convolutional neural networks in scene interpretation?: A review
by: N. Arockia Rosy, et al.
Published: (2025-08-01)

Vision-Degree-Driven Loading Strategy for Real-Time Large-Scale Scene Rendering
by: Yu Ding, et al.
Published: (2025-07-01)

A Dynamic Interference Detection Method of Underwater Scenes Based on Deep Learning and Attention Mechanism
by: Shuo Shang, et al.
Published: (2024-11-01)

Improved YOLOv8s-based foreign object detection method for mine conveyor belts
by: LI Runze, et al.
Published: (2025-06-01)

Dense Segmentation Techniques Using Deep Learning for Urban Scene Parsing: A Review
by: Rajesh Ankareddy, et al.
Published: (2025-01-01)

MYSTERY AND THE POSTMODERN SCENE: PYNCHONEAN VIEW
by: Amira Halim
Published: (2016-06-01)

Semantic-enhanced panoptic scene graph generation through hybrid and axial attentions
by: Xinhe Kuang, et al.
Published: (2024-12-01)

Nav2Scene: Navigation-driven fine-tuning for robot-friendly scene generation
by: Bowei Jiang, et al.
Published: (2025-09-01)

GLFFNet: Global–Local Feature Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation
by: Saifeng Zhu, et al.
Published: (2025-03-01)

MuRelSGG: Multimodal Relationship Prediction for Neurosymbolic Scene Graph Generation
by: Muhammad Junaid Khan, et al.
Published: (2025-01-01)

On the 'Where' and 'When' of Eye Guidance in Real-World Scenes
by: Antje Nuthmann
Published: (2019-11-01)

End-to-end scene text detection and recognition algorithm based on Transformer decoders
by: Jinzhi ZHENG, et al.
Published: (2023-05-01)