Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm

BackgroundThe Department of Rehabilitation Medicine is key to improving patients’ quality of life. Driven by chronic diseases and an aging population, there is a need to enhance the efficiency and resource allocation of outpatient facilities. This study aims to analyze the treatment preferences of o...

Full description

Saved in:
Bibliographic Details
Main Authors: Xuehui Fan, Ruixue Ye, Yan Gao, Kaiwen Xue, Zeyu Zhang, Jing Xu, Jingpu Zhao, Jun Feng, Yulong Wang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2024.1473837/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841533602712518656
author Xuehui Fan
Ruixue Ye
Yan Gao
Kaiwen Xue
Zeyu Zhang
Jing Xu
Jingpu Zhao
Jun Feng
Yulong Wang
author_facet Xuehui Fan
Ruixue Ye
Yan Gao
Kaiwen Xue
Zeyu Zhang
Jing Xu
Jingpu Zhao
Jun Feng
Yulong Wang
author_sort Xuehui Fan
collection DOAJ
description BackgroundThe Department of Rehabilitation Medicine is key to improving patients’ quality of life. Driven by chronic diseases and an aging population, there is a need to enhance the efficiency and resource allocation of outpatient facilities. This study aims to analyze the treatment preferences of outpatient rehabilitation patients by using data and a grading tool to establish predictive models. The goal is to improve patient visit efficiency and optimize resource allocation through these predictive models.MethodsData were collected from 38 Chinese institutions, including 4,244 patients visiting outpatient rehabilitation clinics. Data processing was conducted using Python software. The pandas library was used for data cleaning and preprocessing, involving 68 categorical and 12 continuous variables. The steps included handling missing values, data normalization, and encoding conversion. The data were divided into 80% training and 20% test sets using the Scikit-learn library to ensure model independence and prevent overfitting. Performance comparisons among XGBoost, random forest, and logistic regression were conducted using metrics, including accuracy and receiver operating characteristic (ROC) curves. The imbalanced learning library’s SMOTE technique was used to address the sample imbalance during model training. The model was optimized using a confusion matrix and feature importance analysis, and partial dependence plots (PDP) were used to analyze the key influencing factors.ResultsXGBoost achieved the highest overall accuracy of 80.21% with high precision and recall in Category 1. random forest showed a similar overall accuracy. Logistic Regression had a significantly lower accuracy, indicating difficulties with nonlinear data. The key influencing factors identified include distance to medical institutions, arrival time, length of hospital stay, and specific diseases, such as cardiovascular, pulmonary, oncological, and orthopedic conditions. The tiered diagnosis and treatment tool effectively helped doctors assess patients’ conditions and recommend suitable medical institutions based on rehabilitation grading.ConclusionThis study confirmed that ensemble learning methods, particularly XGBoost, outperform single models in classification tasks involving complex datasets. Addressing class imbalance and enhancing feature engineering can further improve model performance. Understanding patient preferences and the factors influencing medical institution selection can guide healthcare policies to optimize resource allocation, improve service quality, and enhance patient satisfaction. Tiered diagnosis and treatment tools play a crucial role in helping doctors evaluate patient conditions and make informed recommendations for appropriate medical care.
format Article
id doaj-art-215414ace2924f47875327790a4e782b
institution Kabale University
issn 2624-8212
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Artificial Intelligence
spelling doaj-art-215414ace2924f47875327790a4e782b2025-01-15T14:27:13ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14738371473837Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithmXuehui Fan0Ruixue Ye1Yan Gao2Kaiwen Xue3Zeyu Zhang4Jing Xu5Jingpu Zhao6Jun Feng7Yulong Wang8Department of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaLinping Hospital of Integrated Traditional Chinese and Western, Medicine, Hangzhou, Zhejiang, ChinaDepartment of Rehabilitation Medicine, The First Affiliated Hospital of Shenzhen University, The Second People’s Hospital of Shenzhen, Shenzhen, Guangdong, ChinaBackgroundThe Department of Rehabilitation Medicine is key to improving patients’ quality of life. Driven by chronic diseases and an aging population, there is a need to enhance the efficiency and resource allocation of outpatient facilities. This study aims to analyze the treatment preferences of outpatient rehabilitation patients by using data and a grading tool to establish predictive models. The goal is to improve patient visit efficiency and optimize resource allocation through these predictive models.MethodsData were collected from 38 Chinese institutions, including 4,244 patients visiting outpatient rehabilitation clinics. Data processing was conducted using Python software. The pandas library was used for data cleaning and preprocessing, involving 68 categorical and 12 continuous variables. The steps included handling missing values, data normalization, and encoding conversion. The data were divided into 80% training and 20% test sets using the Scikit-learn library to ensure model independence and prevent overfitting. Performance comparisons among XGBoost, random forest, and logistic regression were conducted using metrics, including accuracy and receiver operating characteristic (ROC) curves. The imbalanced learning library’s SMOTE technique was used to address the sample imbalance during model training. The model was optimized using a confusion matrix and feature importance analysis, and partial dependence plots (PDP) were used to analyze the key influencing factors.ResultsXGBoost achieved the highest overall accuracy of 80.21% with high precision and recall in Category 1. random forest showed a similar overall accuracy. Logistic Regression had a significantly lower accuracy, indicating difficulties with nonlinear data. The key influencing factors identified include distance to medical institutions, arrival time, length of hospital stay, and specific diseases, such as cardiovascular, pulmonary, oncological, and orthopedic conditions. The tiered diagnosis and treatment tool effectively helped doctors assess patients’ conditions and recommend suitable medical institutions based on rehabilitation grading.ConclusionThis study confirmed that ensemble learning methods, particularly XGBoost, outperform single models in classification tasks involving complex datasets. Addressing class imbalance and enhancing feature engineering can further improve model performance. Understanding patient preferences and the factors influencing medical institution selection can guide healthcare policies to optimize resource allocation, improve service quality, and enhance patient satisfaction. Tiered diagnosis and treatment tools play a crucial role in helping doctors evaluate patient conditions and make informed recommendations for appropriate medical care.https://www.frontiersin.org/articles/10.3389/frai.2024.1473837/fullXGBoostmachine learning algorithmrehabilitation patientgraded diagnosis and treatmenttreatment preferences
spellingShingle Xuehui Fan
Ruixue Ye
Yan Gao
Kaiwen Xue
Zeyu Zhang
Jing Xu
Jingpu Zhao
Jun Feng
Yulong Wang
Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
Frontiers in Artificial Intelligence
XGBoost
machine learning algorithm
rehabilitation patient
graded diagnosis and treatment
treatment preferences
title Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
title_full Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
title_fullStr Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
title_full_unstemmed Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
title_short Prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on XGBoost machine learning algorithm
title_sort prediction of outpatient rehabilitation patient preferences and optimization of graded diagnosis and treatment based on xgboost machine learning algorithm
topic XGBoost
machine learning algorithm
rehabilitation patient
graded diagnosis and treatment
treatment preferences
url https://www.frontiersin.org/articles/10.3389/frai.2024.1473837/full
work_keys_str_mv AT xuehuifan predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT ruixueye predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT yangao predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT kaiwenxue predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT zeyuzhang predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT jingxu predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT jingpuzhao predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT junfeng predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm
AT yulongwang predictionofoutpatientrehabilitationpatientpreferencesandoptimizationofgradeddiagnosisandtreatmentbasedonxgboostmachinelearningalgorithm