Enhanced ResNet50 for Diabetic Retinopathy Classification: External Attention and Modified Residual Branch

One of the common microvascular complications in diabetic patients is diabetic retinopathy (DR), which primarily impacts the retinal blood vessels. As the course of diabetes progresses, the incidence of DR gradually increases, and, in serious situations, it can cause vision loss and even blindness....

Full description

Saved in:
Bibliographic Details
Main Authors: Menglong Feng, Yixuan Cai, Shen Yan
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/10/1557
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:One of the common microvascular complications in diabetic patients is diabetic retinopathy (DR), which primarily impacts the retinal blood vessels. As the course of diabetes progresses, the incidence of DR gradually increases, and, in serious situations, it can cause vision loss and even blindness. Diagnosing DR early is essential to mitigate its consequences, and deep learning models provide an effective approach. In this study, we propose an improved ResNet50 model, which replaces the 3 × 3 convolution in the residual structure by introducing an external attention mechanism, which improves the model’s awareness of global information and allows the model to grasp the characteristics of the input data more thoroughly. In addition, multiscale convolution is added to the residual branch, which further improves the ability of the model to extract local features and global features, and improves the processing accuracy of image details. In addition, the Sophia optimizer is introduced to replace the traditional Adam optimizer, which further optimizes the classification performance of the model. In this study, 3662 images from the Kaggle open dataset were used to generate 20,184 images for model training after image preprocessing and data augmentation. Experimental results show that the improved ResNet50 model achieves a classification accuracy of 96.68% on the validation set, which is 4.36% higher than the original architecture, and the Kappa value is increased by 5.45%. These improvements contribute to the early diagnosis of DR and decrease the likelihood of blindness among patients.
ISSN:2227-7390