Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model
In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available,...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Advances in Multimedia |
Online Access: | http://dx.doi.org/10.1155/2022/1239337 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832567764495106048 |
---|---|
author | Yue Ma Zhuangzhi Zhi |
author_facet | Yue Ma Zhuangzhi Zhi |
author_sort | Yue Ma |
collection | DOAJ |
description | In order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions. |
format | Article |
id | doaj-art-e7bc5cfafdaf43498655d4896f0a1f79 |
institution | Kabale University |
issn | 1687-5699 |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Advances in Multimedia |
spelling | doaj-art-e7bc5cfafdaf43498655d4896f0a1f792025-02-03T01:00:41ZengWileyAdvances in Multimedia1687-56992022-01-01202210.1155/2022/1239337Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 ModelYue Ma0Zhuangzhi Zhi1Criminal Investigation Police University of ChinaSchool of Medical InstrumentIn order to improve the accuracy and processing speed of object detection in weakly supervised learning environment, a weakly supervised real-time object detection method based on saliency map extraction and improved YOLOv5 is proposed. For the case where only image-level annotations are available, class-specific saliency maps are generated from the backpropagation process using a VGG-16-based classification network. After obtaining the position information of the target in the image, the pseudobounding box of the target is generated, and the pseudobounding box is used as the ground-truth bounding box to optimize the real-time target detection network. An improved YOLOv5 model is proposed to transfer clear target features to deeper network layers by designing a jump connection operation, thereby solving the problem of feature ambiguity. At the same time, the convolutional attention mechanism module is introduced to solve the problem that the recognition accuracy is affected by invalid features. Experiments on the PASCAL VOC 2007+2012 datasets show that when only image-level annotations are available in the training data, the proposed method can effectively improve the processing speed and maintain a good target detection accuracy, realizing real-time object detection under weakly supervised conditions.http://dx.doi.org/10.1155/2022/1239337 |
spellingShingle | Yue Ma Zhuangzhi Zhi Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model Advances in Multimedia |
title | Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model |
title_full | Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model |
title_fullStr | Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model |
title_full_unstemmed | Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model |
title_short | Weakly Supervised Real-Time Object Detection Based on Salient Map Extraction and the Improved YOLOv5 Model |
title_sort | weakly supervised real time object detection based on salient map extraction and the improved yolov5 model |
url | http://dx.doi.org/10.1155/2022/1239337 |
work_keys_str_mv | AT yuema weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model AT zhuangzhizhi weaklysupervisedrealtimeobjectdetectionbasedonsalientmapextractionandtheimprovedyolov5model |