SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma

Abstract Non-small cell lung adenocarcinoma (LUAD) is a markedly heterogeneous disease, with its underlying molecular mechanisms and prognosis prediction presenting ongoing challenges. In this study, we integrated data from multiple public datasets, including TCGA, GSE31210, and GSE13213, encompassi...

Full description

Saved in:
Bibliographic Details
Main Authors: Baozhen Wang, Yichen Yin, Anqi Wang, Weidi Liu, Jing Chen, Tao Li
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-85471-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544807012368384
author Baozhen Wang
Yichen Yin
Anqi Wang
Weidi Liu
Jing Chen
Tao Li
author_facet Baozhen Wang
Yichen Yin
Anqi Wang
Weidi Liu
Jing Chen
Tao Li
author_sort Baozhen Wang
collection DOAJ
description Abstract Non-small cell lung adenocarcinoma (LUAD) is a markedly heterogeneous disease, with its underlying molecular mechanisms and prognosis prediction presenting ongoing challenges. In this study, we integrated data from multiple public datasets, including TCGA, GSE31210, and GSE13213, encompassing a total of 867 tumor samples. By employing Mendelian randomization (MR) analysis, machine learning techniques, and comprehensive bioinformatics approaches, we conducted an in-depth investigation into the molecular characteristics, prognostic markers, and potential therapeutic targets of LUAD. Our analysis identified 321 genes significantly associated with LUAD, with CENP-A, MCM7, and DLGAP5 emerging as highly connected nodes in network analyses. By performing correlation analysis and Cox regression analysis, we identified 26 prognostic genes and classified LUAD samples into two molecular subtypes with significantly distinct survival outcomes. The Random Survival Forest (RSF) model exhibited robust prognostic predictive capabilities across multiple independent cohorts (AUC > 0.75). Beyond merely predicting patient outcomes, this model also captures key features of the tumor immune microenvironment and potential therapeutic responses. Functional enrichment analysis revealed the complex interplay of cell cycle regulation, DNA repair, immune response, and metabolic reprogramming in the progression of LUAD. Furthermore, we observed a strong correlation between risk scores and the expression of specific cytokines, such as CCL17, CCR2, and CCL20, suggesting novel avenues for developing cytokine network-based therapeutic strategies. This study offers fresh insights into the molecular subtyping, prognostic prediction, and personalized therapeutic decision-making in LUAD, laying a critical foundation for future clinical applications and targeted therapy research.
format Article
id doaj-art-317a7ea9b7c243efaf4fdee016ff56f2
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-317a7ea9b7c243efaf4fdee016ff56f22025-01-12T12:16:01ZengNature PortfolioScientific Reports2045-23222025-01-0115111810.1038/s41598-025-85471-8SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinomaBaozhen Wang0Yichen Yin1Anqi Wang2Weidi Liu3Jing Chen4Tao Li5School of Clinical Medicine, Ningxia Medical UniversitySchool of Clinical Medicine, Ningxia Medical UniversityKey Laboratory of Fertility Preservation and Maintenance (Ningxia Medical University), Ministry of EducationSchool of Clinical Medicine, Ningxia Medical UniversityKey Laboratory of Fertility Preservation and Maintenance (Ningxia Medical University), Ministry of EducationDepartment of Surgical Oncology II, The General Hospital of Ningxia Medical UniversityAbstract Non-small cell lung adenocarcinoma (LUAD) is a markedly heterogeneous disease, with its underlying molecular mechanisms and prognosis prediction presenting ongoing challenges. In this study, we integrated data from multiple public datasets, including TCGA, GSE31210, and GSE13213, encompassing a total of 867 tumor samples. By employing Mendelian randomization (MR) analysis, machine learning techniques, and comprehensive bioinformatics approaches, we conducted an in-depth investigation into the molecular characteristics, prognostic markers, and potential therapeutic targets of LUAD. Our analysis identified 321 genes significantly associated with LUAD, with CENP-A, MCM7, and DLGAP5 emerging as highly connected nodes in network analyses. By performing correlation analysis and Cox regression analysis, we identified 26 prognostic genes and classified LUAD samples into two molecular subtypes with significantly distinct survival outcomes. The Random Survival Forest (RSF) model exhibited robust prognostic predictive capabilities across multiple independent cohorts (AUC > 0.75). Beyond merely predicting patient outcomes, this model also captures key features of the tumor immune microenvironment and potential therapeutic responses. Functional enrichment analysis revealed the complex interplay of cell cycle regulation, DNA repair, immune response, and metabolic reprogramming in the progression of LUAD. Furthermore, we observed a strong correlation between risk scores and the expression of specific cytokines, such as CCL17, CCR2, and CCL20, suggesting novel avenues for developing cytokine network-based therapeutic strategies. This study offers fresh insights into the molecular subtyping, prognostic prediction, and personalized therapeutic decision-making in LUAD, laying a critical foundation for future clinical applications and targeted therapy research.https://doi.org/10.1038/s41598-025-85471-8Non-small cell lung adenocarcinoma, LUADMendelian randomizationMolecular subtypesMachine learning prognostic modelMulti-omics integrative analysis
spellingShingle Baozhen Wang
Yichen Yin
Anqi Wang
Weidi Liu
Jing Chen
Tao Li
SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
Scientific Reports
Non-small cell lung adenocarcinoma, LUAD
Mendelian randomization
Molecular subtypes
Machine learning prognostic model
Multi-omics integrative analysis
title SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
title_full SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
title_fullStr SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
title_full_unstemmed SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
title_short SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma
title_sort smr guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non small cell lung adenocarcinoma
topic Non-small cell lung adenocarcinoma, LUAD
Mendelian randomization
Molecular subtypes
Machine learning prognostic model
Multi-omics integrative analysis
url https://doi.org/10.1038/s41598-025-85471-8
work_keys_str_mv AT baozhenwang smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma
AT yichenyin smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma
AT anqiwang smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma
AT weidiliu smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma
AT jingchen smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma
AT taoli smrguidedmolecularsubtypingandmachinelearningmodelrevealsnovelprognosticbiomarkersandtherapeutictargetsinnonsmallcelllungadenocarcinoma