Robust multiclass classification of crop leaf diseases using hybrid deep learning and Grad-CAM interpretability

Abstract The key objective of this study is to propose an effective and accurate deep learning (DL) framework to detect and classify diseases in banana, cherry, and tomato leaves. The performance of multiple pre-trained models is compared against a newly presented model.The experiments used a public...

Full description

Saved in:
Bibliographic Details
Main Authors: Sankar Murugesan, Jayaprakash Chinnadurai, Saravanan Srinivasan, Sandeep Kumar Mathivanan, Radha Raman Chandan, Usha Moorthy
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-14847-7
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract The key objective of this study is to propose an effective and accurate deep learning (DL) framework to detect and classify diseases in banana, cherry, and tomato leaves. The performance of multiple pre-trained models is compared against a newly presented model.The experiments used a publicly released dataset of healthy and unhealthy leaves from banana, cherry, and tomato plants. This dataset was uniformly split into training, validation, and test sets to obtain consistent and unbiased model evaluations. The data pre-processing also involved pre-processing steps suitable for DL architectures to keep the input the same among all the models.We use several state-of-the-art pre-trained ConvNets models for the baselines, such as EfficientNetV2, ConvNeXt, Swin Transformer, and Vi-Transformer (ViT), to have an outlook on the performance. A new ConvNet-ViT hybrid model combines the ConvNet and ViT layers for local feature extraction and maintaining the global context. The classifier’s performance was reinforced by a 5-fold cross-validation mechanism to avoid overfitting.The proposed Hybrid ConvNet-ViT model outperformed all the compared models evaluated, achieving a testing classification accuracy of 99.29%, which outperforms all the pre-trained models. This finding shows that combining ConvNets’ local feature learning with the capability of global representation of the ViT is effective.The result shows that the Hybrid ConvNet-ViT model is an effective and accurate solution in detecting and classifying plant leaf diseases. Its outstanding performance of the state-of-the-art pre-trained top models positions itself as a solid model for practical agricultural use. Fusing the ConvNet and transformer frameworks jointly is beneficial for improving classification performance in image-based disease detection work.
ISSN:2045-2322