Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model

In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available,...

Full description

Saved in:
Bibliographic Details
Main Authors: Yue Ma, Zhuangzhi Zhi
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Advances in Multimedia
Online Access:http://dx.doi.org/10.1155/2022/1239337
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832567764495106048
author Yue Ma
Zhuangzhi Zhi
author_facet Yue Ma
Zhuangzhi Zhi
author_sort Yue Ma
collection DOAJ
description In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions.
format Article
id doaj-art-e7bc5cfafdaf43498655d4896f0a1f79
institution Kabale University
issn 1687-5699
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Advances in Multimedia
spelling doaj-art-e7bc5cfafdaf43498655d4896f0a1f792025-02-03T01:00:41ZengWileyAdvances in Multimedia1687-56992022-01-01202210.1155/2022/1239337Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 ModelYue Ma0Zhuangzhi Zhi1Criminal Investigation Police University of ChinaSchool of Medical InstrumentIn order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions.http://dx.doi.org/10.1155/2022/1239337
spellingShingle Yue Ma
Zhuangzhi Zhi
Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
Advances in Multimedia
title Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_full Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_fullStr Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_full_unstemmed Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_short Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_sort weakly supervised real time object detection based on salient map extraction and the improved yolov5 model
url http://dx.doi.org/10.1155/2022/1239337
work_keys_str_mv AT yuema weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model
AT zhuangzhizhi weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model