Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection
Abstract Vision Transformer-based detectors have achieved remarkable success in the field of object detection, but the application of these models to high-resolution remote sensing imagery faces challenges in computational costs and performance bottlenecks due to the increased computational complexi...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-99835-7 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850042699190632448 |
|---|---|
| author | Lihua Yang Yi Gu Hao Feng |
| author_facet | Lihua Yang Yi Gu Hao Feng |
| author_sort | Lihua Yang |
| collection | DOAJ |
| description | Abstract Vision Transformer-based detectors have achieved remarkable success in the field of object detection, but the application of these models to high-resolution remote sensing imagery faces challenges in computational costs and performance bottlenecks due to the increased computational complexity required to process high-resolution imagery, especially when capturing fine-grained edge features. Therefore, there is significant potential for performance optimization. To address these challenges, we propose an improved EMF-DETR based on RT-DERT-ResNet-18. EMF-DETR introduces a multi-scale edge-aware feature extraction network named MEFE-Net. The network improves object recognition and localization capabilities by extracting multi-scale features and enhancing edge information for targets at each scale, demonstrating exceptional performance in small object detection. To further enhance feature representation, the model introduces the CSFCN method, which adaptively adjusts contextual information and precisely calibrates spatial features, ensuring accurate alignment and optimization of features across different scales. In evaluations on the VisDrone2019 dataset, the proposed method achieved a 2.0% improvement in mAP compared to the baseline model, with increases of 1.5% and 2.6% in small (APS) and medium (APM) object detection respectively. Meanwhile, the number of parameters was reduced by 20.22%, demonstrating not only improved detection accuracy but also lower computational cost, highlighting its practical application potential in remote sensing image analysis. |
| format | Article |
| id | doaj-art-f59d3969b490416cad0e55bb5c0a0f15 |
| institution | DOAJ |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-f59d3969b490416cad0e55bb5c0a0f152025-08-20T02:55:28ZengNature PortfolioScientific Reports2045-23222025-05-0115112210.1038/s41598-025-99835-7Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detectionLihua Yang0Yi Gu1Hao Feng2School of Mechanical and Electronic Engineering, Jingdezhen Ceramic UniversitySchool of Mechanical and Electronic Engineering, Jingdezhen Ceramic UniversitySchool of Mechanical and Electronic Engineering, Jingdezhen Ceramic UniversityAbstract Vision Transformer-based detectors have achieved remarkable success in the field of object detection, but the application of these models to high-resolution remote sensing imagery faces challenges in computational costs and performance bottlenecks due to the increased computational complexity required to process high-resolution imagery, especially when capturing fine-grained edge features. Therefore, there is significant potential for performance optimization. To address these challenges, we propose an improved EMF-DETR based on RT-DERT-ResNet-18. EMF-DETR introduces a multi-scale edge-aware feature extraction network named MEFE-Net. The network improves object recognition and localization capabilities by extracting multi-scale features and enhancing edge information for targets at each scale, demonstrating exceptional performance in small object detection. To further enhance feature representation, the model introduces the CSFCN method, which adaptively adjusts contextual information and precisely calibrates spatial features, ensuring accurate alignment and optimization of features across different scales. In evaluations on the VisDrone2019 dataset, the proposed method achieved a 2.0% improvement in mAP compared to the baseline model, with increases of 1.5% and 2.6% in small (APS) and medium (APM) object detection respectively. Meanwhile, the number of parameters was reduced by 20.22%, demonstrating not only improved detection accuracy but also lower computational cost, highlighting its practical application potential in remote sensing image analysis.https://doi.org/10.1038/s41598-025-99835-7Object detectionRemote sensing imageEdge feature enhancementFeature calibration |
| spellingShingle | Lihua Yang Yi Gu Hao Feng Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection Scientific Reports Object detection Remote sensing image Edge feature enhancement Feature calibration |
| title | Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| title_full | Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| title_fullStr | Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| title_full_unstemmed | Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| title_short | Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| title_sort | multi scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection |
| topic | Object detection Remote sensing image Edge feature enhancement Feature calibration |
| url | https://doi.org/10.1038/s41598-025-99835-7 |
| work_keys_str_mv | AT lihuayang multiscalefeaturefusionandfeaturecalibrationwithedgeinformationenhancementforremotesensingobjectdetection AT yigu multiscalefeaturefusionandfeaturecalibrationwithedgeinformationenhancementforremotesensingobjectdetection AT haofeng multiscalefeaturefusionandfeaturecalibrationwithedgeinformationenhancementforremotesensingobjectdetection |