Fast Visual Tracking with Enhanced and Gradient-Guide Network

The existing Siamese trackers express visual tracking through the cross-correlation operation between two neural networks. Although they dominated the tracking field, their adopted pattern caused two main problems. One is the adoption of the deep architecture that drives the Siamese tracker to sacri...

Full description

Saved in:
Bibliographic Details
Main Authors: Dun Cao, Renhua Dai
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:Advances in Multimedia
Online Access:http://dx.doi.org/10.1155/2024/9944425
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832559445514649600
author Dun Cao
Renhua Dai
author_facet Dun Cao
Renhua Dai
author_sort Dun Cao
collection DOAJ
description The existing Siamese trackers express visual tracking through the cross-correlation operation between two neural networks. Although they dominated the tracking field, their adopted pattern caused two main problems. One is the adoption of the deep architecture that drives the Siamese tracker to sacrifice speed for performance, and the other is that the template is fixed to the initial features; namely, the template cannot be updated timely, making performance entirely dependent on the Siamese network’s matching ability. In this work, we propose a tracker called SiamMLG. Firstly, we adopt the lightweight ResNet-34 as the backbone to improve the proposed tracker’s speed by reducing the computational complexity, and then, to compensate for the performance loss caused by the lightweight backbone, we embed the SKNet from the attention mechanism to filter out the valueless features, and finally, we utilize the gradient-guide strategy to update the template timely. Extensive experiments on four large tracking datasets, including VOT-2016, OTB100, GOT-10k, and UAV123, confirming SiamMLG satisfactorily balance performance and efficiency, where it scores 0.515 on GOT-10k while running at 55 frames per second, which is nearly 3.6 times that of the state-of-the-art method.
format Article
id doaj-art-c624cfbe8a7c497795143f3cefd3955e
institution Kabale University
issn 1687-5699
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series Advances in Multimedia
spelling doaj-art-c624cfbe8a7c497795143f3cefd3955e2025-02-03T01:30:04ZengWileyAdvances in Multimedia1687-56992024-01-01202410.1155/2024/9944425Fast Visual Tracking with Enhanced and Gradient-Guide NetworkDun Cao0Renhua Dai1School of Computer and Communication EngineeringSchool of Computer and Communication EngineeringThe existing Siamese trackers express visual tracking through the cross-correlation operation between two neural networks. Although they dominated the tracking field, their adopted pattern caused two main problems. One is the adoption of the deep architecture that drives the Siamese tracker to sacrifice speed for performance, and the other is that the template is fixed to the initial features; namely, the template cannot be updated timely, making performance entirely dependent on the Siamese network’s matching ability. In this work, we propose a tracker called SiamMLG. Firstly, we adopt the lightweight ResNet-34 as the backbone to improve the proposed tracker’s speed by reducing the computational complexity, and then, to compensate for the performance loss caused by the lightweight backbone, we embed the SKNet from the attention mechanism to filter out the valueless features, and finally, we utilize the gradient-guide strategy to update the template timely. Extensive experiments on four large tracking datasets, including VOT-2016, OTB100, GOT-10k, and UAV123, confirming SiamMLG satisfactorily balance performance and efficiency, where it scores 0.515 on GOT-10k while running at 55 frames per second, which is nearly 3.6 times that of the state-of-the-art method.http://dx.doi.org/10.1155/2024/9944425
spellingShingle Dun Cao
Renhua Dai
Fast Visual Tracking with Enhanced and Gradient-Guide Network
Advances in Multimedia
title Fast Visual Tracking with Enhanced and Gradient-Guide Network
title_full Fast Visual Tracking with Enhanced and Gradient-Guide Network
title_fullStr Fast Visual Tracking with Enhanced and Gradient-Guide Network
title_full_unstemmed Fast Visual Tracking with Enhanced and Gradient-Guide Network
title_short Fast Visual Tracking with Enhanced and Gradient-Guide Network
title_sort fast visual tracking with enhanced and gradient guide network
url http://dx.doi.org/10.1155/2024/9944425
work_keys_str_mv AT duncao fastvisualtrackingwithenhancedandgradientguidenetwork
AT renhuadai fastvisualtrackingwithenhancedandgradientguidenetwork