GDText-VM: an arbitrary-shaped scene text detector based on globally deformable VMamba
Abstract Detecting arbitrary-shaped text in natural scenes remains a significant challenge in deep learning research. Contemporary text detectors based on Convolutional Neural Networks face challenges in effectively modeling long-range dependencies. While Vision Transformers theoretically enable glo...
Saved in:
| Main Authors: | Yingnan Zhao, Zheng Hu, Fangqi Ding, Jielin Jiang, Xiaolong Xu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-06-01
|
| Series: | Complex & Intelligent Systems |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s40747-025-01987-6 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Cascaded Dual-Inpainting Network for Scene Text
by: Chunmei Liu
Published: (2025-07-01) -
A VMamba-Based Spatial–Spectral Fusion Network for Remote Sensing Image Classification
by: Lan Luo, et al.
Published: (2025-01-01) -
Turkish scene text recognition: Introducing extensive real and synthetic datasets and a novel recognition model
by: Serdar Yıldız
Published: (2024-12-01) -
A text clarification and deep relational reasoning method for Mongolian-Chinese bilingual arbitrary-shaped scene text detection
by: Yuefeng Liu, et al.
Published: (2025-07-01) -
VMMCD: VMamba-Based Multi-Scale Feature Guiding Fusion Network for Remote Sensing Change Detection
by: Zhong Chen, et al.
Published: (2025-05-01)