FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection
Object detection in remote sensing images is essential for applications like unmanned aerial vehicle (UAV)-assisted agricultural surveys and aerial traffic analysis, facing unique challenges such as low resolution, complex backgrounds, and the variability of object scales. Current detectors struggle...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/17/2/205 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832587598735867904 |
---|---|
author | Yixin Chen Weilai Jiang Yaonan Wang |
author_facet | Yixin Chen Weilai Jiang Yaonan Wang |
author_sort | Yixin Chen |
collection | DOAJ |
description | Object detection in remote sensing images is essential for applications like unmanned aerial vehicle (UAV)-assisted agricultural surveys and aerial traffic analysis, facing unique challenges such as low resolution, complex backgrounds, and the variability of object scales. Current detectors struggle with integrating spatial and semantic information effectively across scales and often omit necessary refinement modules to focus on salient features. Furthermore, a detector head that lacks a meticulous design may face limitations in fully understanding and accurately predicting based on the enriched feature representations. These deficiencies can lead to insufficient feature representation and reduced detection accuracy. To address these challenges, this paper introduces a novel deep-learning framework, FAMHE-Net, for enhancing object detection in remote sensing images. Our framework features a consolidated multi-scale feature enhancement module (CMFEM) with integrated Path Aggregation Feature Pyramid Network (PAFPN), utilizing our efficient atrous channel attention (EACA) within CMFEM for enhanced contextual and semantic information refinement. Additionally, we introduce a sparsely gated mixture of heterogeneous expert heads (MOHEH) to adaptively aggregate detector head outputs. Compared to the baseline model, FAMEH-Net demonstrates significant improvements, achieving a 0.90% increase in mean Average Precision (mAP) of the DOTA dataset and a 1.30% increase in mAP12 of HRSC2016 datasets. These results highlight the effectiveness of FAMEH-Net in object detection within complex remote sensing images. |
format | Article |
id | doaj-art-cde32949b6b04e0aae378a5397b36fe8 |
institution | Kabale University |
issn | 2072-4292 |
language | English |
publishDate | 2025-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Remote Sensing |
spelling | doaj-art-cde32949b6b04e0aae378a5397b36fe82025-01-24T13:47:43ZengMDPI AGRemote Sensing2072-42922025-01-0117220510.3390/rs17020205FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object DetectionYixin Chen0Weilai Jiang1Yaonan Wang2The College of Electrical and Information Engineering, Hunan University, Changsha 410082, ChinaThe College of Electrical and Information Engineering, Hunan University, Changsha 410082, ChinaThe College of Electrical and Information Engineering, Hunan University, Changsha 410082, ChinaObject detection in remote sensing images is essential for applications like unmanned aerial vehicle (UAV)-assisted agricultural surveys and aerial traffic analysis, facing unique challenges such as low resolution, complex backgrounds, and the variability of object scales. Current detectors struggle with integrating spatial and semantic information effectively across scales and often omit necessary refinement modules to focus on salient features. Furthermore, a detector head that lacks a meticulous design may face limitations in fully understanding and accurately predicting based on the enriched feature representations. These deficiencies can lead to insufficient feature representation and reduced detection accuracy. To address these challenges, this paper introduces a novel deep-learning framework, FAMHE-Net, for enhancing object detection in remote sensing images. Our framework features a consolidated multi-scale feature enhancement module (CMFEM) with integrated Path Aggregation Feature Pyramid Network (PAFPN), utilizing our efficient atrous channel attention (EACA) within CMFEM for enhanced contextual and semantic information refinement. Additionally, we introduce a sparsely gated mixture of heterogeneous expert heads (MOHEH) to adaptively aggregate detector head outputs. Compared to the baseline model, FAMEH-Net demonstrates significant improvements, achieving a 0.90% increase in mean Average Precision (mAP) of the DOTA dataset and a 1.30% increase in mAP12 of HRSC2016 datasets. These results highlight the effectiveness of FAMEH-Net in object detection within complex remote sensing images.https://www.mdpi.com/2072-4292/17/2/205deep learningremote sensingobject detectionmulti-scale feature enhancement |
spellingShingle | Yixin Chen Weilai Jiang Yaonan Wang FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection Remote Sensing deep learning remote sensing object detection multi-scale feature enhancement |
title | FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection |
title_full | FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection |
title_fullStr | FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection |
title_full_unstemmed | FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection |
title_short | FAMHE-Net: Multi-Scale Feature Augmentation and Mixture of Heterogeneous Experts for Oriented Object Detection |
title_sort | famhe net multi scale feature augmentation and mixture of heterogeneous experts for oriented object detection |
topic | deep learning remote sensing object detection multi-scale feature enhancement |
url | https://www.mdpi.com/2072-4292/17/2/205 |
work_keys_str_mv | AT yixinchen famhenetmultiscalefeatureaugmentationandmixtureofheterogeneousexpertsfororientedobjectdetection AT weilaijiang famhenetmultiscalefeatureaugmentationandmixtureofheterogeneousexpertsfororientedobjectdetection AT yaonanwang famhenetmultiscalefeatureaugmentationandmixtureofheterogeneousexpertsfororientedobjectdetection |