Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model

In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yue Ma, Zhuangzhi Zhi
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	Advances in Multimedia
Online Access:	http://dx.doi.org/10.1155/2022/1239337
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832567764495106048
author	Yue Ma Zhuangzhi Zhi
author_facet	Yue Ma Zhuangzhi Zhi
author_sort	Yue Ma
collection	DOAJ
description	In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions.
format	Article
id	doaj-art-e7bc5cfafdaf43498655d4896f0a1f79
institution	Kabale University
issn	1687-5699
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	Advances in Multimedia
spelling	doaj-art-e7bc5cfafdaf43498655d4896f0a1f792025-02-03T01:00:41ZengWileyAdvances in Multimedia1687-56992022-01-01202210.1155/2022/1239337Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 ModelYue Ma0Zhuangzhi Zhi1Criminal Investigation Police University of ChinaSchool of Medical InstrumentIn order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions.http://dx.doi.org/10.1155/2022/1239337
spellingShingle	Yue Ma Zhuangzhi Zhi Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model Advances in Multimedia
title	Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_full	Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_fullStr	Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_full_unstemmed	Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_short	Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
title_sort	weakly supervised real time object detection based on salient map extraction and the improved yolov5 model
url	http://dx.doi.org/10.1155/2022/1239337
work_keys_str_mv	AT yuema weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model AT zhuangzhizhi weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model

Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model

Similar Items