An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique

To detect and recognize small-size and submerged complex background targets in infrared images, we combine a dynamic receptive field fusion strategy and a multi-scale feature fusion mechanism to improve the detection performance of small targets significantly. The space-to-depth convolution module i...

Full description

Saved in:
Bibliographic Details
Main Authors: Yongjun Qi, Shaohua Yang, Zhengzheng Jia, Yuanmeng Song, Jie Zhu, Xin Liu, Hongxing Zheng
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Technologies
Subjects:
Online Access:https://www.mdpi.com/2227-7080/13/1/40
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587454481170432
author Yongjun Qi
Shaohua Yang
Zhengzheng Jia
Yuanmeng Song
Jie Zhu
Xin Liu
Hongxing Zheng
author_facet Yongjun Qi
Shaohua Yang
Zhengzheng Jia
Yuanmeng Song
Jie Zhu
Xin Liu
Hongxing Zheng
author_sort Yongjun Qi
collection DOAJ
description To detect and recognize small-size and submerged complex background targets in infrared images, we combine a dynamic receptive field fusion strategy and a multi-scale feature fusion mechanism to improve the detection performance of small targets significantly. The space-to-depth convolution module is introduced as a downsampling layer in the backbone first and achieves the same sampling effect. More detailed information is retained at the same time. Thus, the model’s detection capability for small targets has been enhanced. Then, the pyramid level 2 feature map with minimum receptive field and maximum resolution is added to the neck, which reduces the loss of positional information during feature sampling. Furthermore, x-small detection heads are added, the understanding of the overall characteristics and structure of the target is enhanced much more, and the representation and localization of small targets have been improved. Finally, the cross-entropy loss function in the original network model is replaced by an adaptive threshold focal loss function, forcing the model to allocate more attention to target features. The above methods are based on a public tool, the eighth version of You Only Look Once (YOLO) improved, it is named SPT–YOLO (SPDConv + P2 + Adaptive Threshold + YOLOV8s) in this paper. Some experiments on datasets such as infrared small object detection (IR-SOD) and infrared small target detection 1K(IRSTD-1K), etc. have been executed to verify the proposed algorithm; and the mean average precision of 94.0% and 69% under the condition of threshold at 0.5 and over a range from 0.5 to 0.95 is obtained, respectively. The results show that the proposed method achieves the best performance of infrared small target detection compared to existing methods.
format Article
id doaj-art-c4ebbab0fda045d28aaff73e5bec6ec8
institution Kabale University
issn 2227-7080
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Technologies
spelling doaj-art-c4ebbab0fda045d28aaff73e5bec6ec82025-01-24T13:50:50ZengMDPI AGTechnologies2227-70802025-01-011314010.3390/technologies13010040An Investigation of Infrared Small Target Detection by Using the SPT–YOLO TechniqueYongjun Qi0Shaohua Yang1Zhengzheng Jia2Yuanmeng Song3Jie Zhu4Xin Liu5Hongxing Zheng6School of Computer Science and Engineering of North China Institute of Aerospace Engineering, Langfang 065000, ChinaSchool of Aeronautics and Astronautics of North China Institute of Aerospace Engineering, Langfang 065000, ChinaSchool of Computer Science and Engineering of North China Institute of Aerospace Engineering, Langfang 065000, ChinaSchool of Computer Science and Engineering of North China Institute of Aerospace Engineering, Langfang 065000, ChinaSchool of Computer Science and Engineering of North China Institute of Aerospace Engineering, Langfang 065000, ChinaSchool of Optoelectronic Engineering, Xidian University, Xi’an 710071, ChinaSchool of Computer Science and Engineering of North China Institute of Aerospace Engineering, Langfang 065000, ChinaTo detect and recognize small-size and submerged complex background targets in infrared images, we combine a dynamic receptive field fusion strategy and a multi-scale feature fusion mechanism to improve the detection performance of small targets significantly. The space-to-depth convolution module is introduced as a downsampling layer in the backbone first and achieves the same sampling effect. More detailed information is retained at the same time. Thus, the model’s detection capability for small targets has been enhanced. Then, the pyramid level 2 feature map with minimum receptive field and maximum resolution is added to the neck, which reduces the loss of positional information during feature sampling. Furthermore, x-small detection heads are added, the understanding of the overall characteristics and structure of the target is enhanced much more, and the representation and localization of small targets have been improved. Finally, the cross-entropy loss function in the original network model is replaced by an adaptive threshold focal loss function, forcing the model to allocate more attention to target features. The above methods are based on a public tool, the eighth version of You Only Look Once (YOLO) improved, it is named SPT–YOLO (SPDConv + P2 + Adaptive Threshold + YOLOV8s) in this paper. Some experiments on datasets such as infrared small object detection (IR-SOD) and infrared small target detection 1K(IRSTD-1K), etc. have been executed to verify the proposed algorithm; and the mean average precision of 94.0% and 69% under the condition of threshold at 0.5 and over a range from 0.5 to 0.95 is obtained, respectively. The results show that the proposed method achieves the best performance of infrared small target detection compared to existing methods.https://www.mdpi.com/2227-7080/13/1/40infrared small target detectiondynamic receptive field fusionmulti-scale feature fusionspace-to-depth convolutionYOLO
spellingShingle Yongjun Qi
Shaohua Yang
Zhengzheng Jia
Yuanmeng Song
Jie Zhu
Xin Liu
Hongxing Zheng
An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
Technologies
infrared small target detection
dynamic receptive field fusion
multi-scale feature fusion
space-to-depth convolution
YOLO
title An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
title_full An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
title_fullStr An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
title_full_unstemmed An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
title_short An Investigation of Infrared Small Target Detection by Using the SPT–YOLO Technique
title_sort investigation of infrared small target detection by using the spt yolo technique
topic infrared small target detection
dynamic receptive field fusion
multi-scale feature fusion
space-to-depth convolution
YOLO
url https://www.mdpi.com/2227-7080/13/1/40
work_keys_str_mv AT yongjunqi aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT shaohuayang aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT zhengzhengjia aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT yuanmengsong aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT jiezhu aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT xinliu aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT hongxingzheng aninvestigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT yongjunqi investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT shaohuayang investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT zhengzhengjia investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT yuanmengsong investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT jiezhu investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT xinliu investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique
AT hongxingzheng investigationofinfraredsmalltargetdetectionbyusingthesptyolotechnique