Identification of Leaf Rust-Related Gene Signature in Wheat (Triticum Aestivum L.) Using High-Throughput Sequencing, Network Analysis, and Machine Learning Algorithms
Abstract Wheat provides staple food and industrial raw material for humans and animals, but its production decreased due to leaf rust (Lr) disease caused by Puccinia triticina by up to 15%. It is challenging to identify Lr-associated genes due to the limited sample size and large genome, which hinde...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-08-01
|
| Series: | Rice |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s12284-025-00839-8 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract Wheat provides staple food and industrial raw material for humans and animals, but its production decreased due to leaf rust (Lr) disease caused by Puccinia triticina by up to 15%. It is challenging to identify Lr-associated genes due to the limited sample size and large genome, which hinders the breeding efforts for Lr disease. This study integrated RNA-seq data to mine the candidate genes using meta-analysis, WGCNA, and machine-learning approaches. As a result, 2153 upregulated and 1579 downregulated meta-differentially expressed genes (meta-DEGs) were identified, with four known genes (Lr13 and Lr67/Yr46/Sr55). The meta-DEGs were significantly enriched in antifungal innate immune response, glutathione metabolism, detoxification, phenylalanine, and flavonoid biosynthesis. Among these, 124 resistance (R) genes (~ 85.48% upregulated) were expressed differentially, and ~ 80% belonged to plant pattern recognition receptors (PPRs) that triggered immunity. Likewise, 162 transcription factors (TFs), including WRKY (43), ERF (30), and MYB (33), were associated with Lr disease, and 81 candidate hub genes were co-expressed for Lr. Finally, nine potential candidate genes, including TraesCS7A03G0388400 (BSP), TraesCS1A03G0869900 (PR4), TraesCS6B03G1228800 (AP2/ERF), TraesCS3B03G0088700 (MYB62), TraesCS5A03G1198800 (CYP96A10), and TraesCSU03G0129300 (LTP4), were mined via attribute weighting and machine learning model (XGBoost AUC = 0.97 and accuracy = 0.90) and validated via single-gene model, linear regression, and t-test at p ≤ 0.05. The relative expressions calculated via RT-qPCR assay of nine genes were significantly higher at different time points under Lr infection. Thus, this study reported genes under Lr infection using advanced bioinformatics and supervised machine-learning models, which provide fundamental insight and a solid foundation for understanding the molecular mechanisms of Lr resistance and offer an advanced pipeline for future breeding programs to develop superior cultivars with durable resistance. |
|---|---|
| ISSN: | 1939-8425 1939-8433 |