Development and validation of an early predictive model for hemiplegic shoulder pain: a comparative study of logistic regression, support vector machine, and random forest

ObjectiveIn this study, we aim to identify the predictive variables for hemiplegic shoulder pain (HSP) through machine learning algorithms, select the optimal model and predict the occurrence of HSP.MethodsData of 332 stroke patients admitted to a tertiary hospital in Zhejiang Province from January...

Full description

Saved in:
Bibliographic Details
Main Authors: Qiang Wu, Fang Zhang, Yuchang Fei, Zhenfen Sima, Shanshan Gong, Qifeng Tong, Qingchuan Jiao, Hao Wu, Jianqiu Gong
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-06-01
Series:Frontiers in Neurology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fneur.2025.1612222/full
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ObjectiveIn this study, we aim to identify the predictive variables for hemiplegic shoulder pain (HSP) through machine learning algorithms, select the optimal model and predict the occurrence of HSP.MethodsData of 332 stroke patients admitted to a tertiary hospital in Zhejiang Province from January 2022 to January 2023 were collected. After screening predictive variables by LASSO regression, three predictive models selected using the LazyPredict package, namely logistic regression (LR), support vector machine (SVM) and random forest (RF), were established respectively. The performance parameters (accuracy, precision, recall, and F1 score) of the models were calculated, the receiver operating characteristic curve (ROC) and the decision curve analysis (DCA) were plotted to compare the performance of the three models. An explainability analysis (SHAP) was conducted on the optimal model.ResultsThe RF model performed the best, with accuracy: 0.90, precision: 0.89, recall: 0.88, F1 score: 0.86, AUC-ROC: 0.94, and the range of the threshold probability in DCA: 7%−99%. Based on the SHAP analysis of the explainability of the RF model, the contribution degrees of the early HSP predictive variables from high to low are as follows: multiple injuries, shoulder joint flexion (p), biceps tendon effusion, sensory disorder, supraspinatus tendinopathy, subluxation, diabetes, and age.ConclusionThe RF prediction model has a good predictive effect on HSP and has good clinical explainability. It can provide objective references for the early warning and stratified management of HSP.
ISSN:1664-2295