A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
Abstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. Howe...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMC
2025-03-01
|
| Series: | BMC Geriatrics |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s12877-025-05840-w |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849392606013816832 |
|---|---|
| author | Yuyi Peng Chi Zhang Bo Zhou |
| author_facet | Yuyi Peng Chi Zhang Bo Zhou |
| author_sort | Yuyi Peng |
| collection | DOAJ |
| description | Abstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. However,with the advent of advanced statistics techniques,machine learning models offer promising alternatives for improving prediction accuracy. What’s more, studies that use risk factors and prediction models for osteoporosis in high-risk groups for cardiovascular diseases are scarce. We aimed to explore the risk factors and disease probability of osteoporosis by comparing logistic regression with four machine learning models. By doing so,we seek to provide insights into the most effective methods for osteoporosis risk assessment and contribute to the development of tailored prevention strategies at high risk of cardiovascular disease among old adults. Methods We carried out a cross-sectional investigation of a high-risk group in cardiovascular patients. A logistic regression model and four common machine learning methods,DT,RF,SVM,and XGBoost were implemented to create a prediction model using information from 211 participants who met the inclusion requirements. Metrics for calibration and discrimination were used to compare the models. Results In total,211 patients were enrolled. The AUCs were 0.751 for the logistic regression model,0.72 for the SVM model,0.70 for the random forest model,0.697 for the model XGBoost,and 0.69 for the decision tree model. The logistic regression model outperforms other models for machine learning. According to the logistic regression model,there were nine predictors,including age,sex,glucose,TG (triglyceride),fracture history,stroke history,and CNV (copy number variation) nssv659422, and low-sodium salt. A well-calibrated result of 0.199 on the Brier scale. The findings of the internal validation demonstrated the high degree of repeatability of the prediction model employed in this study. Conclusions In this study, we discovered that when predicting osteoporosis,a number of machine learning techniques fell short of logistic regression. In a specific population, we have innovatively developed a risk prediction model for osteoporosis events that integrates genetic and environmental factors, is an effective tool for assessing osteoporosis risk and can serve as the basis for specialized intervention approaches. |
| format | Article |
| id | doaj-art-1864b51abc9c45c29df95c86aa89d1b8 |
| institution | Kabale University |
| issn | 1471-2318 |
| language | English |
| publishDate | 2025-03-01 |
| publisher | BMC |
| record_format | Article |
| series | BMC Geriatrics |
| spelling | doaj-art-1864b51abc9c45c29df95c86aa89d1b82025-08-20T03:40:44ZengBMCBMC Geriatrics1471-23182025-03-0125111510.1186/s12877-025-05840-wA cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adultsYuyi Peng0Chi Zhang1Bo Zhou2Department of Clinical Epidemiology and Evidence-Based Medicine, The First Hospital of China Medical UniversityDepartment of Orthopedics, The Fourth Affiliated Hospital, China Medical UniversityDepartment of Clinical Epidemiology and Evidence-Based Medicine, The First Hospital of China Medical UniversityAbstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. However,with the advent of advanced statistics techniques,machine learning models offer promising alternatives for improving prediction accuracy. What’s more, studies that use risk factors and prediction models for osteoporosis in high-risk groups for cardiovascular diseases are scarce. We aimed to explore the risk factors and disease probability of osteoporosis by comparing logistic regression with four machine learning models. By doing so,we seek to provide insights into the most effective methods for osteoporosis risk assessment and contribute to the development of tailored prevention strategies at high risk of cardiovascular disease among old adults. Methods We carried out a cross-sectional investigation of a high-risk group in cardiovascular patients. A logistic regression model and four common machine learning methods,DT,RF,SVM,and XGBoost were implemented to create a prediction model using information from 211 participants who met the inclusion requirements. Metrics for calibration and discrimination were used to compare the models. Results In total,211 patients were enrolled. The AUCs were 0.751 for the logistic regression model,0.72 for the SVM model,0.70 for the random forest model,0.697 for the model XGBoost,and 0.69 for the decision tree model. The logistic regression model outperforms other models for machine learning. According to the logistic regression model,there were nine predictors,including age,sex,glucose,TG (triglyceride),fracture history,stroke history,and CNV (copy number variation) nssv659422, and low-sodium salt. A well-calibrated result of 0.199 on the Brier scale. The findings of the internal validation demonstrated the high degree of repeatability of the prediction model employed in this study. Conclusions In this study, we discovered that when predicting osteoporosis,a number of machine learning techniques fell short of logistic regression. In a specific population, we have innovatively developed a risk prediction model for osteoporosis events that integrates genetic and environmental factors, is an effective tool for assessing osteoporosis risk and can serve as the basis for specialized intervention approaches.https://doi.org/10.1186/s12877-025-05840-wCopy number variantsOsteoporosisPrediction modelMachine learningLow sodium salt intake |
| spellingShingle | Yuyi Peng Chi Zhang Bo Zhou A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults BMC Geriatrics Copy number variants Osteoporosis Prediction model Machine learning Low sodium salt intake |
| title | A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| title_full | A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| title_fullStr | A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| title_full_unstemmed | A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| title_short | A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| title_sort | cross sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults |
| topic | Copy number variants Osteoporosis Prediction model Machine learning Low sodium salt intake |
| url | https://doi.org/10.1186/s12877-025-05840-w |
| work_keys_str_mv | AT yuyipeng acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults AT chizhang acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults AT bozhou acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults AT yuyipeng crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults AT chizhang crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults AT bozhou crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults |