Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning

Traffic accidents occur frequently, causing significant losses to people’s lives and property safety. Accurately predicting the severity level of traffic accidents is of great significance. Based on traffic accident data, this study comprehensively considers various influencing factors such as the g...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiming Tang, Yao Huang, Dingli Liu, Liuyuan Xiong, Rongwei Bu
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Systems
Subjects:
Online Access:https://www.mdpi.com/2079-8954/13/1/31
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587469457981440
author Jiming Tang
Yao Huang
Dingli Liu
Liuyuan Xiong
Rongwei Bu
author_facet Jiming Tang
Yao Huang
Dingli Liu
Liuyuan Xiong
Rongwei Bu
author_sort Jiming Tang
collection DOAJ
description Traffic accidents occur frequently, causing significant losses to people’s lives and property safety. Accurately predicting the severity level of traffic accidents is of great significance. Based on traffic accident data, this study comprehensively considers various influencing factors such as the geographical location, road conditions, and environment. The data are divided into accident-related categories, weather-related categories, and road- and environment-related categories. The machine learning method is improved through integration for the accident level prediction. In the experiment, effective preprocessing measures were taken for problems such as data imbalance, missing values, the encoding of categorical variables, and the standardization of numerical features. The unbalanced distribution of “Severity” was improved through under-sampling and over-sampling techniques. Firstly, we adopted a multi-stage fusion strategy. A multi-layer perceptron (MLP) was used for the preliminary prediction, and then its result was combined with the original features to form a new feature. Decision tree, XGBoost, and random forest algorithms, respectively, were applied for the secondary prediction. The analysis results show that the improved machine learning model is significantly superior to a single model in the overall performance. The “MLP + random forest” model performs well in evaluation indicators such as the accuracy, recall rate, and F1 value. The accuracy rate is as high as 94%. In the prediction of different traffic accident severity levels (minor, moderate, and severe), the improved machine learning model also generally shows better performance and stability. The research results of this study have broad prospects in the field of intelligent driving. It can realize real-time accident prediction and early warnings, and provide decision support for drivers and autonomous driving systems. The research also provides a scientific basis for traffic planning and management departments to improve driving conditions and reduce the probability and losses of traffic accidents.
format Article
id doaj-art-f63c7a50ebb34004aecad186c76e52b8
institution Kabale University
issn 2079-8954
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Systems
spelling doaj-art-f63c7a50ebb34004aecad186c76e52b82025-01-24T13:50:33ZengMDPI AGSystems2079-89542025-01-011313110.3390/systems13010031Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine LearningJiming Tang0Yao Huang1Dingli Liu2Liuyuan Xiong3Rongwei Bu4School of Traffic and Transportation Engineering, Engineering Research Center of Catastrophic Prophylaxis and Treatment of Road & Traffic Safety of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, ChinaSchool of Traffic and Transportation Engineering, Engineering Research Center of Catastrophic Prophylaxis and Treatment of Road & Traffic Safety of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, ChinaSchool of Traffic and Transportation Engineering, Engineering Research Center of Catastrophic Prophylaxis and Treatment of Road & Traffic Safety of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, ChinaSchool of Traffic and Transportation Engineering, Engineering Research Center of Catastrophic Prophylaxis and Treatment of Road & Traffic Safety of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, ChinaSchool of Traffic and Transportation Engineering, Engineering Research Center of Catastrophic Prophylaxis and Treatment of Road & Traffic Safety of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, ChinaTraffic accidents occur frequently, causing significant losses to people’s lives and property safety. Accurately predicting the severity level of traffic accidents is of great significance. Based on traffic accident data, this study comprehensively considers various influencing factors such as the geographical location, road conditions, and environment. The data are divided into accident-related categories, weather-related categories, and road- and environment-related categories. The machine learning method is improved through integration for the accident level prediction. In the experiment, effective preprocessing measures were taken for problems such as data imbalance, missing values, the encoding of categorical variables, and the standardization of numerical features. The unbalanced distribution of “Severity” was improved through under-sampling and over-sampling techniques. Firstly, we adopted a multi-stage fusion strategy. A multi-layer perceptron (MLP) was used for the preliminary prediction, and then its result was combined with the original features to form a new feature. Decision tree, XGBoost, and random forest algorithms, respectively, were applied for the secondary prediction. The analysis results show that the improved machine learning model is significantly superior to a single model in the overall performance. The “MLP + random forest” model performs well in evaluation indicators such as the accuracy, recall rate, and F1 value. The accuracy rate is as high as 94%. In the prediction of different traffic accident severity levels (minor, moderate, and severe), the improved machine learning model also generally shows better performance and stability. The research results of this study have broad prospects in the field of intelligent driving. It can realize real-time accident prediction and early warnings, and provide decision support for drivers and autonomous driving systems. The research also provides a scientific basis for traffic planning and management departments to improve driving conditions and reduce the probability and losses of traffic accidents.https://www.mdpi.com/2079-8954/13/1/31traffic accidentseverityimproved machine learningprediction modelsampling technique
spellingShingle Jiming Tang
Yao Huang
Dingli Liu
Liuyuan Xiong
Rongwei Bu
Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
Systems
traffic accident
severity
improved machine learning
prediction model
sampling technique
title Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
title_full Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
title_fullStr Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
title_full_unstemmed Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
title_short Research on Traffic Accident Severity Level Prediction Model Based on Improved Machine Learning
title_sort research on traffic accident severity level prediction model based on improved machine learning
topic traffic accident
severity
improved machine learning
prediction model
sampling technique
url https://www.mdpi.com/2079-8954/13/1/31
work_keys_str_mv AT jimingtang researchontrafficaccidentseveritylevelpredictionmodelbasedonimprovedmachinelearning
AT yaohuang researchontrafficaccidentseveritylevelpredictionmodelbasedonimprovedmachinelearning
AT dingliliu researchontrafficaccidentseveritylevelpredictionmodelbasedonimprovedmachinelearning
AT liuyuanxiong researchontrafficaccidentseveritylevelpredictionmodelbasedonimprovedmachinelearning
AT rongweibu researchontrafficaccidentseveritylevelpredictionmodelbasedonimprovedmachinelearning