MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction

Pedestrian detection is widely used in real-time surveillance, urban traffic, and other fields. As a crucial direction in pedestrian detection, dense pedestrian detection still faces many unresolved challenges. Existing methods suffer from low detection accuracy, high miss rates, large model paramet...

Full description

Saved in:
Bibliographic Details
Main Authors: Qiang Liu, Zhongmin Li, Lei Zhang, Jin Deng
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/2/438
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587479062937600
author Qiang Liu
Zhongmin Li
Lei Zhang
Jin Deng
author_facet Qiang Liu
Zhongmin Li
Lei Zhang
Jin Deng
author_sort Qiang Liu
collection DOAJ
description Pedestrian detection is widely used in real-time surveillance, urban traffic, and other fields. As a crucial direction in pedestrian detection, dense pedestrian detection still faces many unresolved challenges. Existing methods suffer from low detection accuracy, high miss rates, large model parameters, and poor robustness. In this paper, to address these issues, we propose a lightweight dense pedestrian detection model with finer-grained feature information interaction called MSCD-YOLO, which can achieve high accuracy, high performance and robustness with only a small number of parameters. In our model, the light-weight backbone network MobileViT is used to reduce the number of parameters while efficiently extracting both local and global features; the SCNeck neck network is designed to fuse the extracted features without losing information; and the DEHead detection head is utilized for multi-scale feature fusion to detect the targets. To demonstrate the effectiveness of our model, we conducted tests on the highly challenging dense pedestrian detection datasets Crowdhuman and Widerperson. Compared to the baseline model YOLOv8n, MSCD-YOLO achieved a 4.6% and 1.8% improvement in mAP@0.5, and a 5.3% and 2.6% improvement in mAP@0.5:0.95 on the Crowdhuman and Widerperson datasets, respectively. The experimental results show that under the same experimental conditions, MSCD-YOLO significantly outperforms the original model in terms of detection accuracy, efficiency, and model complexity.
format Article
id doaj-art-17cc77174e054364afe6735502e687a5
institution Kabale University
issn 1424-8220
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-17cc77174e054364afe6735502e687a52025-01-24T13:48:56ZengMDPI AGSensors1424-82202025-01-0125243810.3390/s25020438MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information InteractionQiang Liu0Zhongmin Li1Lei Zhang2Jin Deng3School of Information Engineering, Nanchang Hangkong University, Nanchang 330063, ChinaSchool of Information Engineering, Nanchang Hangkong University, Nanchang 330063, ChinaSchool of Information Engineering, Nanchang Hangkong University, Nanchang 330063, ChinaSchool of Information Engineering, Nanchang Hangkong University, Nanchang 330063, ChinaPedestrian detection is widely used in real-time surveillance, urban traffic, and other fields. As a crucial direction in pedestrian detection, dense pedestrian detection still faces many unresolved challenges. Existing methods suffer from low detection accuracy, high miss rates, large model parameters, and poor robustness. In this paper, to address these issues, we propose a lightweight dense pedestrian detection model with finer-grained feature information interaction called MSCD-YOLO, which can achieve high accuracy, high performance and robustness with only a small number of parameters. In our model, the light-weight backbone network MobileViT is used to reduce the number of parameters while efficiently extracting both local and global features; the SCNeck neck network is designed to fuse the extracted features without losing information; and the DEHead detection head is utilized for multi-scale feature fusion to detect the targets. To demonstrate the effectiveness of our model, we conducted tests on the highly challenging dense pedestrian detection datasets Crowdhuman and Widerperson. Compared to the baseline model YOLOv8n, MSCD-YOLO achieved a 4.6% and 1.8% improvement in mAP@0.5, and a 5.3% and 2.6% improvement in mAP@0.5:0.95 on the Crowdhuman and Widerperson datasets, respectively. The experimental results show that under the same experimental conditions, MSCD-YOLO significantly outperforms the original model in terms of detection accuracy, efficiency, and model complexity.https://www.mdpi.com/1424-8220/25/2/438deep learningdense pedestrian detectionYOLOv8nMobile-ViTSCNeckDEHead
spellingShingle Qiang Liu
Zhongmin Li
Lei Zhang
Jin Deng
MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
Sensors
deep learning
dense pedestrian detection
YOLOv8n
Mobile-ViT
SCNeck
DEHead
title MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
title_full MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
title_fullStr MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
title_full_unstemmed MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
title_short MSCD-YOLO: A Lightweight Dense Pedestrian Detection Model with Finer-Grained Feature Information Interaction
title_sort mscd yolo a lightweight dense pedestrian detection model with finer grained feature information interaction
topic deep learning
dense pedestrian detection
YOLOv8n
Mobile-ViT
SCNeck
DEHead
url https://www.mdpi.com/1424-8220/25/2/438
work_keys_str_mv AT qiangliu mscdyoloalightweightdensepedestriandetectionmodelwithfinergrainedfeatureinformationinteraction
AT zhongminli mscdyoloalightweightdensepedestriandetectionmodelwithfinergrainedfeatureinformationinteraction
AT leizhang mscdyoloalightweightdensepedestriandetectionmodelwithfinergrainedfeatureinformationinteraction
AT jindeng mscdyoloalightweightdensepedestriandetectionmodelwithfinergrainedfeatureinformationinteraction