A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults

Abstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. Howe...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuyi Peng, Chi Zhang, Bo Zhou
Format: Article
Language:English
Published: BMC 2025-03-01
Series:BMC Geriatrics
Subjects:
Online Access:https://doi.org/10.1186/s12877-025-05840-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849392606013816832
author Yuyi Peng
Chi Zhang
Bo Zhou
author_facet Yuyi Peng
Chi Zhang
Bo Zhou
author_sort Yuyi Peng
collection DOAJ
description Abstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. However,with the advent of advanced statistics techniques,machine learning models offer promising alternatives for improving prediction accuracy. What’s more, studies that use risk factors and prediction models for osteoporosis in high-risk groups for cardiovascular diseases are scarce. We aimed to explore the risk factors and disease probability of osteoporosis by comparing logistic regression with four machine learning models. By doing so,we seek to provide insights into the most effective methods for osteoporosis risk assessment and contribute to the development of tailored prevention strategies at high risk of cardiovascular disease among old adults. Methods We carried out a cross-sectional investigation of a high-risk group in cardiovascular patients. A logistic regression model and four common machine learning methods,DT,RF,SVM,and XGBoost were implemented to create a prediction model using information from 211 participants who met the inclusion requirements. Metrics for calibration and discrimination were used to compare the models. Results In total,211 patients were enrolled. The AUCs were 0.751 for the logistic regression model,0.72 for the SVM model,0.70 for the random forest model,0.697 for the model XGBoost,and 0.69 for the decision tree model. The logistic regression model outperforms other models for machine learning. According to the logistic regression model,there were nine predictors,including age,sex,glucose,TG (triglyceride),fracture history,stroke history,and CNV (copy number variation) nssv659422, and low-sodium salt. A well-calibrated result of 0.199 on the Brier scale. The findings of the internal validation demonstrated the high degree of repeatability of the prediction model employed in this study. Conclusions In this study, we discovered that when predicting osteoporosis,a number of machine learning techniques fell short of logistic regression. In a specific population, we have innovatively developed a risk prediction model for osteoporosis events that integrates genetic and environmental factors, is an effective tool for assessing osteoporosis risk and can serve as the basis for specialized intervention approaches.
format Article
id doaj-art-1864b51abc9c45c29df95c86aa89d1b8
institution Kabale University
issn 1471-2318
language English
publishDate 2025-03-01
publisher BMC
record_format Article
series BMC Geriatrics
spelling doaj-art-1864b51abc9c45c29df95c86aa89d1b82025-08-20T03:40:44ZengBMCBMC Geriatrics1471-23182025-03-0125111510.1186/s12877-025-05840-wA cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adultsYuyi Peng0Chi Zhang1Bo Zhou2Department of Clinical Epidemiology and Evidence-Based Medicine, The First Hospital of China Medical UniversityDepartment of Orthopedics, The Fourth Affiliated Hospital, China Medical UniversityDepartment of Clinical Epidemiology and Evidence-Based Medicine, The First Hospital of China Medical UniversityAbstract Background Osteoporosis has become a significant public health concern that necessitates the application of appropriate techniques to calculate disease risk. Traditional methods, such as logistic regression,have been widely used to identify risk factors and predict disease probability. However,with the advent of advanced statistics techniques,machine learning models offer promising alternatives for improving prediction accuracy. What’s more, studies that use risk factors and prediction models for osteoporosis in high-risk groups for cardiovascular diseases are scarce. We aimed to explore the risk factors and disease probability of osteoporosis by comparing logistic regression with four machine learning models. By doing so,we seek to provide insights into the most effective methods for osteoporosis risk assessment and contribute to the development of tailored prevention strategies at high risk of cardiovascular disease among old adults. Methods We carried out a cross-sectional investigation of a high-risk group in cardiovascular patients. A logistic regression model and four common machine learning methods,DT,RF,SVM,and XGBoost were implemented to create a prediction model using information from 211 participants who met the inclusion requirements. Metrics for calibration and discrimination were used to compare the models. Results In total,211 patients were enrolled. The AUCs were 0.751 for the logistic regression model,0.72 for the SVM model,0.70 for the random forest model,0.697 for the model XGBoost,and 0.69 for the decision tree model. The logistic regression model outperforms other models for machine learning. According to the logistic regression model,there were nine predictors,including age,sex,glucose,TG (triglyceride),fracture history,stroke history,and CNV (copy number variation) nssv659422, and low-sodium salt. A well-calibrated result of 0.199 on the Brier scale. The findings of the internal validation demonstrated the high degree of repeatability of the prediction model employed in this study. Conclusions In this study, we discovered that when predicting osteoporosis,a number of machine learning techniques fell short of logistic regression. In a specific population, we have innovatively developed a risk prediction model for osteoporosis events that integrates genetic and environmental factors, is an effective tool for assessing osteoporosis risk and can serve as the basis for specialized intervention approaches.https://doi.org/10.1186/s12877-025-05840-wCopy number variantsOsteoporosisPrediction modelMachine learningLow sodium salt intake
spellingShingle Yuyi Peng
Chi Zhang
Bo Zhou
A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
BMC Geriatrics
Copy number variants
Osteoporosis
Prediction model
Machine learning
Low sodium salt intake
title A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
title_full A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
title_fullStr A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
title_full_unstemmed A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
title_short A cross-sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
title_sort cross sectional study comparing machine learning and logistic regression techniques for predicting osteoporosis in a group at high risk of cardiovascular disease among old adults
topic Copy number variants
Osteoporosis
Prediction model
Machine learning
Low sodium salt intake
url https://doi.org/10.1186/s12877-025-05840-w
work_keys_str_mv AT yuyipeng acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults
AT chizhang acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults
AT bozhou acrosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults
AT yuyipeng crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults
AT chizhang crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults
AT bozhou crosssectionalstudycomparingmachinelearningandlogisticregressiontechniquesforpredictingosteoporosisinagroupathighriskofcardiovasculardiseaseamongoldadults