Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study

Abstract Primary liver cancer is the sixth most commonly diagnosed cancer globally and the third leading cause of cancer-related deaths. Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer, and microvascular invasion (MVI) is a significant risk factor affecting postoperati...

Full description

Saved in:
Bibliographic Details
Main Authors: Zongren Ding, Jianxing Zeng, Guoxu Fang, Pengfei Guo, Weiping Zhou, Yongyi Zeng
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-08502-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849344566978674688
author Zongren Ding
Jianxing Zeng
Guoxu Fang
Pengfei Guo
Weiping Zhou
Yongyi Zeng
author_facet Zongren Ding
Jianxing Zeng
Guoxu Fang
Pengfei Guo
Weiping Zhou
Yongyi Zeng
author_sort Zongren Ding
collection DOAJ
description Abstract Primary liver cancer is the sixth most commonly diagnosed cancer globally and the third leading cause of cancer-related deaths. Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer, and microvascular invasion (MVI) is a significant risk factor affecting postoperative prognosis in HCC. However, accurately predicting MVI preoperatively remains a challenge. This study aims to evaluate the application of large language models (LLMs), specifically ChatGPT 4o, in predicting MVI in HCC and to compare its performance with traditional clinical models. In this retrospective study, 300 HCC patients who underwent curative liver resection between June 2018 and December 2018 were selected at two centers. The collected clinical data included age, gender, HBV infection, liver cirrhosis, AFP levels, and more. ChatGPT 4o were used to process the clinical data of the patients and predict MVI. Subsequently, the predictive results of the ChatGPT 4o were compared with machine learning models, the ROC curves were plotted, and AUC was calculated. The results showed that the AUC of the ChatGPT 4o was 0.755. Machine learning algorithms use Random Forest, Support Vector Machine, Logistic Regression, XGBoost and Decision Tree, the AUC of 5 machine learning algorithms was range from 0.534 to 0.624. ChatGPT 4o achieved the highest AUC and showed statistically significant differences compared to Support Vector Machine, Logistic Regression and Decision Tree. Additionally, the predictive results of the ChatGPT 4o effectively stratified the postoperative overall survival (OS) and recurrence-free survival (RFS) of HCC patients. LLMs have demonstrated significant predictive capabilities for MVI in HCC and for risk stratification regarding postoperative OS and RFS. These advancements possess substantial potential to enhance preoperative management and make surgical planning.
format Article
id doaj-art-4794ee9d72e741e6afca3f1c2cb0551a
institution Kabale University
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-4794ee9d72e741e6afca3f1c2cb0551a2025-08-20T03:42:38ZengNature PortfolioScientific Reports2045-23222025-07-011511710.1038/s41598-025-08502-4Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter studyZongren Ding0Jianxing Zeng1Guoxu Fang2Pengfei Guo3Weiping Zhou4Yongyi Zeng5Department of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical UniversityDepartment of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical UniversityDepartment of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical UniversityThe Big Data Institute of Southeast Hepatobiliary Health Information, Mengchao Hepatobiliary Hospital of Fujian Medical UniversityThe Third Department of Hepatic Surgery, Eastern Hepatobiliary Surgery Hospital, Second Military Medical UniversityDepartment of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical UniversityAbstract Primary liver cancer is the sixth most commonly diagnosed cancer globally and the third leading cause of cancer-related deaths. Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer, and microvascular invasion (MVI) is a significant risk factor affecting postoperative prognosis in HCC. However, accurately predicting MVI preoperatively remains a challenge. This study aims to evaluate the application of large language models (LLMs), specifically ChatGPT 4o, in predicting MVI in HCC and to compare its performance with traditional clinical models. In this retrospective study, 300 HCC patients who underwent curative liver resection between June 2018 and December 2018 were selected at two centers. The collected clinical data included age, gender, HBV infection, liver cirrhosis, AFP levels, and more. ChatGPT 4o were used to process the clinical data of the patients and predict MVI. Subsequently, the predictive results of the ChatGPT 4o were compared with machine learning models, the ROC curves were plotted, and AUC was calculated. The results showed that the AUC of the ChatGPT 4o was 0.755. Machine learning algorithms use Random Forest, Support Vector Machine, Logistic Regression, XGBoost and Decision Tree, the AUC of 5 machine learning algorithms was range from 0.534 to 0.624. ChatGPT 4o achieved the highest AUC and showed statistically significant differences compared to Support Vector Machine, Logistic Regression and Decision Tree. Additionally, the predictive results of the ChatGPT 4o effectively stratified the postoperative overall survival (OS) and recurrence-free survival (RFS) of HCC patients. LLMs have demonstrated significant predictive capabilities for MVI in HCC and for risk stratification regarding postoperative OS and RFS. These advancements possess substantial potential to enhance preoperative management and make surgical planning.https://doi.org/10.1038/s41598-025-08502-4Hepatocellular carcinomaLarge language modelsMicrovascular invasion
spellingShingle Zongren Ding
Jianxing Zeng
Guoxu Fang
Pengfei Guo
Weiping Zhou
Yongyi Zeng
Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
Scientific Reports
Hepatocellular carcinoma
Large language models
Microvascular invasion
title Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
title_full Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
title_fullStr Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
title_full_unstemmed Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
title_short Evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in HCC: a multicenter study
title_sort evaluating the efficacy of using large language models in preoperative prediction of microvascular invasion in hcc a multicenter study
topic Hepatocellular carcinoma
Large language models
Microvascular invasion
url https://doi.org/10.1038/s41598-025-08502-4
work_keys_str_mv AT zongrending evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy
AT jianxingzeng evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy
AT guoxufang evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy
AT pengfeiguo evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy
AT weipingzhou evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy
AT yongyizeng evaluatingtheefficacyofusinglargelanguagemodelsinpreoperativepredictionofmicrovascularinvasioninhccamulticenterstudy