Constructing a fall risk prediction model for hospitalized patients using machine learning

Abstract Study objectives This study aimed to identify the risk factors associated with falls in hospitalized patients, develop a predictive risk model using machine learning algorithms, and evaluate the validity of the model’s predictions. Study design A cross-sectional design was employed using da...

Full description

Saved in:
Bibliographic Details
Main Authors: Cheng-Wei Kang, Zhao-Kui Yan, Jia-Liang Tian, Xiao-Bing Pu, Li-Xue Wu
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Public Health
Subjects:
Online Access:https://doi.org/10.1186/s12889-025-21284-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Study objectives This study aimed to identify the risk factors associated with falls in hospitalized patients, develop a predictive risk model using machine learning algorithms, and evaluate the validity of the model’s predictions. Study design A cross-sectional design was employed using data from the DRYAD public database. Research methods The study utilized data from the Fukushima Medical University Hospital Cohort Study, obtained from the DRYAD public database. 20% of the dataset was allocated as an independent test set, while the remaining 80% was utilized for training and validation. To address data imbalance in binary variables, the Synthetic Minority Oversampling Technique combined with Edited Nearest Neighbors (SMOTE-ENN) was applied. Univariate analysis and least absolute shrinkage and selection operator (LASSO) regression were used to analyze and screen variables. Predictive models were constructed by integrating key clinical features, and eight machine learning algorithms were evaluated to identify the most effective model. Additionally, SHAP (Shapley Additive Explanations) was used to interpret the predictive models and rank the importance of risk factors. Results The final model included the following variables: Adl_standing, Adl_evacuation, Age_group, Planned_surgery, Wheelchair, History_of_falls, Hypnotic_drugs, Psychotropic_drugs, and Remote_caring_system. Among the evaluated models, the Random Forest algorithm demonstrated superior performance, achieving an AUC of 0.814 (95% CI: 0.802–0.827) in the training set, 0.781 (95% CI: 0.740–0.821) in the validation set, and 0.795 (95% CI: 0.770–0.820) in the test set. Conclusion Machine learning algorithms, particularly Random Forest, are effective in predicting fall risk among hospitalized patients. These findings can significantly enhance fall prevention strategies within healthcare settings.
ISSN:1471-2458