Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients

Abstract Objective Endometrial lesions are a frequent complication following breast cancer, and current diagnostic tools have limitations. This study aims to develop a machine learning-based nomogram model for predicting the early detection of endometrial lesions in patients. The model is designed t...

Full description

Saved in:
Bibliographic Details
Main Authors: Cao Shaoshan, Chen Niannian, Ma Ying
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-82373-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544756160626688
author Cao Shaoshan
Chen Niannian
Ma Ying
author_facet Cao Shaoshan
Chen Niannian
Ma Ying
author_sort Cao Shaoshan
collection DOAJ
description Abstract Objective Endometrial lesions are a frequent complication following breast cancer, and current diagnostic tools have limitations. This study aims to develop a machine learning-based nomogram model for predicting the early detection of endometrial lesions in patients. The model is designed to assess risk and facilitate individualized treatment strategies for premenopausal breast cancer patients. Method A retrospective study was conducted on 224 patients who underwent diagnostic curettage post-tamoxifen (TAM) therapy between November 2012 and November 2023. These patients exhibited signs of endometrial abnormalities or symptoms such as colporrhagia. Clinical data were collected and analyzed using R software (version 4.3.2) to identify factors influencing the occurrence of endometrial lesions and evaluate their predictive values. Three machine learning methods were employed to develop a risk prediction model, and their performances were compared. The best-performing model was selected to construct a nomogram of endometrial lesions. Internal validation was conducted using the bootstrap method, and the model’s accuracy and fit were assessed using the concordance index (C-index) and calibration curves. Results Independent risk factors for endometrial lesions included ultrasound characteristics, duration of TAM therapy, presence of colporrhagia, and endometrial thickness (P < 0.05). Among the machine learning methods compared, the LASSO regression integrated with a multifactorial logistic regression model demonstrated strong performance, with a concordance index (C-index) of 0.874 and effective calibration (mean absolute error of conformity: 0.014). This model achieved an accuracy of 0.853 and a precision of 0.917, with a training set AUC of 0.874 (95% CI: 0.794–0.831) and a test set AUC of 0.891 (95% CI: 0.777-1.000), closely aligning the predicted risk with the actual observed risk. Conclusion The developed prediction model is effective in evaluating endometrial lesions in premenopausal breast cancer patients. This model offers a theoretical foundation for improving clinical predictions and devising tailored treatment strategies for this patient group.
format Article
id doaj-art-8076f0dde62949588e5fbd34fd951461
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-8076f0dde62949588e5fbd34fd9514612025-01-12T12:19:30ZengNature PortfolioScientific Reports2045-23222025-01-0115111410.1038/s41598-024-82373-zMachine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patientsCao Shaoshan0Chen Niannian1Ma Ying2Department of Obstetrics and Gynecology, Mianyang Central Hospital, University of Electronic Science and Technology of ChinaSchool of Information Engineering, Southwest University of Science and TechnologyDepartment of Obstetrics and Gynecology, Mianyang Central Hospital, University of Electronic Science and Technology of ChinaAbstract Objective Endometrial lesions are a frequent complication following breast cancer, and current diagnostic tools have limitations. This study aims to develop a machine learning-based nomogram model for predicting the early detection of endometrial lesions in patients. The model is designed to assess risk and facilitate individualized treatment strategies for premenopausal breast cancer patients. Method A retrospective study was conducted on 224 patients who underwent diagnostic curettage post-tamoxifen (TAM) therapy between November 2012 and November 2023. These patients exhibited signs of endometrial abnormalities or symptoms such as colporrhagia. Clinical data were collected and analyzed using R software (version 4.3.2) to identify factors influencing the occurrence of endometrial lesions and evaluate their predictive values. Three machine learning methods were employed to develop a risk prediction model, and their performances were compared. The best-performing model was selected to construct a nomogram of endometrial lesions. Internal validation was conducted using the bootstrap method, and the model’s accuracy and fit were assessed using the concordance index (C-index) and calibration curves. Results Independent risk factors for endometrial lesions included ultrasound characteristics, duration of TAM therapy, presence of colporrhagia, and endometrial thickness (P < 0.05). Among the machine learning methods compared, the LASSO regression integrated with a multifactorial logistic regression model demonstrated strong performance, with a concordance index (C-index) of 0.874 and effective calibration (mean absolute error of conformity: 0.014). This model achieved an accuracy of 0.853 and a precision of 0.917, with a training set AUC of 0.874 (95% CI: 0.794–0.831) and a test set AUC of 0.891 (95% CI: 0.777-1.000), closely aligning the predicted risk with the actual observed risk. Conclusion The developed prediction model is effective in evaluating endometrial lesions in premenopausal breast cancer patients. This model offers a theoretical foundation for improving clinical predictions and devising tailored treatment strategies for this patient group.https://doi.org/10.1038/s41598-024-82373-zbreast cancertamoxifenendometrial lesionsnomogramprediction model
spellingShingle Cao Shaoshan
Chen Niannian
Ma Ying
Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
Scientific Reports
breast cancer
tamoxifen
endometrial lesions
nomogram
prediction model
title Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
title_full Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
title_fullStr Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
title_full_unstemmed Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
title_short Machine learning Nomogram for Predicting endometrial lesions after tamoxifen therapy in breast Cancer patients
title_sort machine learning nomogram for predicting endometrial lesions after tamoxifen therapy in breast cancer patients
topic breast cancer
tamoxifen
endometrial lesions
nomogram
prediction model
url https://doi.org/10.1038/s41598-024-82373-z
work_keys_str_mv AT caoshaoshan machinelearningnomogramforpredictingendometriallesionsaftertamoxifentherapyinbreastcancerpatients
AT chenniannian machinelearningnomogramforpredictingendometriallesionsaftertamoxifentherapyinbreastcancerpatients
AT maying machinelearningnomogramforpredictingendometriallesionsaftertamoxifentherapyinbreastcancerpatients