Improved object detection method for autonomous driving based on DETR

Object detection is a critical component in the development of autonomous driving technology and has demonstrated significant growth potential. To address the limitations of current techniques, this paper presents an improved object detection method for autonomous driving based on a detection transf...

Full description

Saved in:
Bibliographic Details
Main Authors: Huaqi Zhao, Songnan Zhang, Xiang Peng, Zhengguang Lu, Guojing Li
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Neurorobotics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fnbot.2024.1484276/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832593903147024384
author Huaqi Zhao
Songnan Zhang
Xiang Peng
Zhengguang Lu
Guojing Li
author_facet Huaqi Zhao
Songnan Zhang
Xiang Peng
Zhengguang Lu
Guojing Li
author_sort Huaqi Zhao
collection DOAJ
description Object detection is a critical component in the development of autonomous driving technology and has demonstrated significant growth potential. To address the limitations of current techniques, this paper presents an improved object detection method for autonomous driving based on a detection transformer (DETR). First, we introduce a multi-scale feature and location information extraction method, which solves the inadequacy of the model for multi-scale object localization and detection. In addition, we developed a transformer encoder based on the group axial attention mechanism. This allows for efficient attention range control in the horizontal and vertical directions while reducing computation, ultimately enhancing the inference speed. Furthermore, we propose a novel dynamic hyperparameter tuning training method based on Pareto efficiency, which coordinates the training state of the loss functions through dynamic weights, overcoming issues associated with manually setting fixed weights and enhancing model convergence speed and accuracy. Experimental results demonstrate that the proposed method surpasses others, with improvements of 3.3%, 4.5%, and 3% in average precision on the COCO, PASCAL VOC, and KITTI datasets, respectively, and an 84% increase in FPS.
format Article
id doaj-art-003ac97d0f3f4f6fa76838e7f2a37565
institution Kabale University
issn 1662-5218
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Neurorobotics
spelling doaj-art-003ac97d0f3f4f6fa76838e7f2a375652025-01-20T07:20:10ZengFrontiers Media S.A.Frontiers in Neurorobotics1662-52182025-01-011810.3389/fnbot.2024.14842761484276Improved object detection method for autonomous driving based on DETRHuaqi Zhao0Songnan Zhang1Xiang Peng2Zhengguang Lu3Guojing Li4The Heilongjiang Provincial Key Laboratory of Autonomous Intelligence and Information Processing, School of Information and Electronic Technology, Jiamusi University, Jiamusi, ChinaThe Heilongjiang Provincial Key Laboratory of Autonomous Intelligence and Information Processing, School of Information and Electronic Technology, Jiamusi University, Jiamusi, ChinaThe Heilongjiang Provincial Key Laboratory of Autonomous Intelligence and Information Processing, School of Information and Electronic Technology, Jiamusi University, Jiamusi, ChinaThe Heilongjiang Provincial Key Laboratory of Autonomous Intelligence and Information Processing, School of Information and Electronic Technology, Jiamusi University, Jiamusi, ChinaSchool of Materials Science and Engineering, Jiamusi University, Jiamusi, ChinaObject detection is a critical component in the development of autonomous driving technology and has demonstrated significant growth potential. To address the limitations of current techniques, this paper presents an improved object detection method for autonomous driving based on a detection transformer (DETR). First, we introduce a multi-scale feature and location information extraction method, which solves the inadequacy of the model for multi-scale object localization and detection. In addition, we developed a transformer encoder based on the group axial attention mechanism. This allows for efficient attention range control in the horizontal and vertical directions while reducing computation, ultimately enhancing the inference speed. Furthermore, we propose a novel dynamic hyperparameter tuning training method based on Pareto efficiency, which coordinates the training state of the loss functions through dynamic weights, overcoming issues associated with manually setting fixed weights and enhancing model convergence speed and accuracy. Experimental results demonstrate that the proposed method surpasses others, with improvements of 3.3%, 4.5%, and 3% in average precision on the COCO, PASCAL VOC, and KITTI datasets, respectively, and an 84% increase in FPS.https://www.frontiersin.org/articles/10.3389/fnbot.2024.1484276/fullobject detectionfeature extractiontransformer encoderloss functionparameter tuning
spellingShingle Huaqi Zhao
Songnan Zhang
Xiang Peng
Zhengguang Lu
Guojing Li
Improved object detection method for autonomous driving based on DETR
Frontiers in Neurorobotics
object detection
feature extraction
transformer encoder
loss function
parameter tuning
title Improved object detection method for autonomous driving based on DETR
title_full Improved object detection method for autonomous driving based on DETR
title_fullStr Improved object detection method for autonomous driving based on DETR
title_full_unstemmed Improved object detection method for autonomous driving based on DETR
title_short Improved object detection method for autonomous driving based on DETR
title_sort improved object detection method for autonomous driving based on detr
topic object detection
feature extraction
transformer encoder
loss function
parameter tuning
url https://www.frontiersin.org/articles/10.3389/fnbot.2024.1484276/full
work_keys_str_mv AT huaqizhao improvedobjectdetectionmethodforautonomousdrivingbasedondetr
AT songnanzhang improvedobjectdetectionmethodforautonomousdrivingbasedondetr
AT xiangpeng improvedobjectdetectionmethodforautonomousdrivingbasedondetr
AT zhengguanglu improvedobjectdetectionmethodforautonomousdrivingbasedondetr
AT guojingli improvedobjectdetectionmethodforautonomousdrivingbasedondetr