Establishment of predictive models for postoperative delirium in elderly patients after knee/hip surgery based on total bilirubin concentration: machine learning algorithms

Abstract Background With the aging demographic on the rise, we’re seeing a spike in the occurrence of postoperative delirium (POD). Our research aims to delve into the connection between plasma bilirubin levels and postoperative delirium, with the goal of crafting ten machine learning (ML) models ca...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuhui Hua, Chuan Li, Yuanlong Wang, YiZhi Liang, Shanling Xu, Jian Kong, Hongyan Gong, Rui Dong, Yanan Lin, Xu Lin, Yanlin Bi, Bin Wang
Format: Article
Language:English
Published: BMC 2025-07-01
Series:BMC Anesthesiology
Subjects:
Online Access:https://doi.org/10.1186/s12871-025-03259-9
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Background With the aging demographic on the rise, we’re seeing a spike in the occurrence of postoperative delirium (POD). Our research aims to delve into the connection between plasma bilirubin levels and postoperative delirium, with the goal of crafting ten machine learning (ML) models capable of predicting POD instances. Methods This study enrolled 621 elderly patients after knee/hip surgery. We used the Confusion Assessment Method (CAM) to assess whether participants had POD. Univariate binary logistic regression analysis and restricted cubic spline (RCS) analysis were used to evaluate the association between plasma total bilirubin and POD. This study further investigated whether cerebrospinal fluid plays some role in the relationship between bilirubin and POD using mediated causal analysis. Subsequently, we employed ten machine learning algorithms to train and develop the predictive models: Logistic Regression (LR), Support Vector Machine (SVM), Gradient Boosting Model (GBM), Neural Network (NN), Random Forest (RF), Xgboost, K-Nearest Neighbors (KNN), AdaBoost, LightGBM, and CatBoost. The performance of the models was evaluated by the area under the receiver operating characteristic curve (AUROC), Brier score, accuracy, sensitivity, specificity, precision, F1 score, calibration curve, decision curve, clinical impact curve, and confusion matrix. In addition, the model was interpreted through Shapley additive interpretation (SHAP) analysis to clarify the importance of bilirubin in the model and its decision-making basis. Results Univariate binary logistic regression analysis revealed that plasma total bilirubin was associated with POD. Furthermore, the RCS analysis illustrated there was no nonlinear relationship between total bilirubin and POD. Mediation analysis indicted that T-tau mediated the effect of total bilirubin on POD. Total bilirubin and other features(age, educational level, BMI, history of diabetes, ASA, albumin, Aβ42, T-tau and P-tau) were used to construct ML models. Compared with other ML algorithms, NN showed better performance, with an AUC of 0.973 (95% CI (0.959–0.987)) in the test set. In addition, the SHAP method determines that age and education are the main determinants that affect the prediction of ML models. Conclusion Plasma total bilirubin was identified as a preoperative risk factor for postoperative delirium (POD). Among ten ML models, the Neural Network (NN) incorporating total bilirubin showed the best predictive performance for POD. Trial registration Clinical Registration No. ChiCTR2000033439. Registration data:2020.06.01.
ISSN:1471-2253