Triple-attentions based salient object detector for strip steel surface defects

Abstract Accurate detection of surface defects on strip steel is essential for ensuring strip steel product quality. Existing deep learning based detectors for strip steel surface defects typically strive to iteratively refine and integrate the coarse outputs of the backbone network, enhancing the m...

Full description

Saved in:
Bibliographic Details
Main Authors: Li Zhang, Xirui Li, Yange Sun, Huaping Guo
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-86353-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585713292410880
author Li Zhang
Xirui Li
Yange Sun
Huaping Guo
author_facet Li Zhang
Xirui Li
Yange Sun
Huaping Guo
author_sort Li Zhang
collection DOAJ
description Abstract Accurate detection of surface defects on strip steel is essential for ensuring strip steel product quality. Existing deep learning based detectors for strip steel surface defects typically strive to iteratively refine and integrate the coarse outputs of the backbone network, enhancing the models’ ability to express defect characteristics. Attention mechanisms including spatial attention, channel attention and self-attention are among the most prevalent techniques for feature extraction and fusion. This paper introduces an innovative triple-attention mechanism (TA), characterized by interrelated and complementary interactions, that concurrently and iteratively refines and integrates feature maps from three distinct perspectives, thereby enhancing the features’ capacity for representation. The idea is from the following observation: given a three-dimensional feature map, we can examine the feature map from the three different yet interrelated two-dimensional planar perspectives: the channel-width, channel-height, and width-height perspectives. Based on the TA, a novel detector, called TADet, for the detection of steel strip surface defects is proposed, which is an encoder-decoder network: the decoder uses the proposed TA refines/fuses the multiscale rough features generated by the encoder (backbone network) from the three distinct perspectives (branches) and then integrates the purified feature maps from the three branches. Extensive experimental results show that TADet is superior to the state-of-the-art methods in terms of mean absolute error, S-measure, E-measure and F-measure, confirming the effectiveness and robustness of the proposed TADet. Our code and experimental results are available at https://github.com/hpguo1982/TADet .
format Article
id doaj-art-6a37c4675f4f46bb99cddfa624b68efb
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-6a37c4675f4f46bb99cddfa624b68efb2025-01-26T12:32:41ZengNature PortfolioScientific Reports2045-23222025-01-0115111310.1038/s41598-025-86353-9Triple-attentions based salient object detector for strip steel surface defectsLi Zhang0Xirui Li1Yange Sun2Huaping Guo3School of Computer and Information Technology, Xinyang Normal UniversitySchool of Computer and Information Technology, Xinyang Normal UniversitySchool of Computer and Information Technology, Xinyang Normal UniversitySchool of Computer and Information Technology, Xinyang Normal UniversityAbstract Accurate detection of surface defects on strip steel is essential for ensuring strip steel product quality. Existing deep learning based detectors for strip steel surface defects typically strive to iteratively refine and integrate the coarse outputs of the backbone network, enhancing the models’ ability to express defect characteristics. Attention mechanisms including spatial attention, channel attention and self-attention are among the most prevalent techniques for feature extraction and fusion. This paper introduces an innovative triple-attention mechanism (TA), characterized by interrelated and complementary interactions, that concurrently and iteratively refines and integrates feature maps from three distinct perspectives, thereby enhancing the features’ capacity for representation. The idea is from the following observation: given a three-dimensional feature map, we can examine the feature map from the three different yet interrelated two-dimensional planar perspectives: the channel-width, channel-height, and width-height perspectives. Based on the TA, a novel detector, called TADet, for the detection of steel strip surface defects is proposed, which is an encoder-decoder network: the decoder uses the proposed TA refines/fuses the multiscale rough features generated by the encoder (backbone network) from the three distinct perspectives (branches) and then integrates the purified feature maps from the three branches. Extensive experimental results show that TADet is superior to the state-of-the-art methods in terms of mean absolute error, S-measure, E-measure and F-measure, confirming the effectiveness and robustness of the proposed TADet. Our code and experimental results are available at https://github.com/hpguo1982/TADet .https://doi.org/10.1038/s41598-025-86353-9Triple attentionSpatial attentionDefect detectionFeature fusion
spellingShingle Li Zhang
Xirui Li
Yange Sun
Huaping Guo
Triple-attentions based salient object detector for strip steel surface defects
Scientific Reports
Triple attention
Spatial attention
Defect detection
Feature fusion
title Triple-attentions based salient object detector for strip steel surface defects
title_full Triple-attentions based salient object detector for strip steel surface defects
title_fullStr Triple-attentions based salient object detector for strip steel surface defects
title_full_unstemmed Triple-attentions based salient object detector for strip steel surface defects
title_short Triple-attentions based salient object detector for strip steel surface defects
title_sort triple attentions based salient object detector for strip steel surface defects
topic Triple attention
Spatial attention
Defect detection
Feature fusion
url https://doi.org/10.1038/s41598-025-86353-9
work_keys_str_mv AT lizhang tripleattentionsbasedsalientobjectdetectorforstripsteelsurfacedefects
AT xiruili tripleattentionsbasedsalientobjectdetectorforstripsteelsurfacedefects
AT yangesun tripleattentionsbasedsalientobjectdetectorforstripsteelsurfacedefects
AT huapingguo tripleattentionsbasedsalientobjectdetectorforstripsteelsurfacedefects