Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data

Breast cancer is a global health concern where early and accurate diagnosis is crucial. Mammogram scans provide detailed imaging but require expert interpretation, which is time-consuming. While deep learning shows promise in medical image analysis, the prevalence of imbalanced datasets in medical d...

Full description

Saved in:
Bibliographic Details
Main Authors: Abdullah Fahad A. Alshamrani, Faisal Saleh Zuhair Alshomrani
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10819401/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841550743520149504
author Abdullah Fahad A. Alshamrani
Faisal Saleh Zuhair Alshomrani
author_facet Abdullah Fahad A. Alshamrani
Faisal Saleh Zuhair Alshomrani
author_sort Abdullah Fahad A. Alshamrani
collection DOAJ
description Breast cancer is a global health concern where early and accurate diagnosis is crucial. Mammogram scans provide detailed imaging but require expert interpretation, which is time-consuming. While deep learning shows promise in medical image analysis, the prevalence of imbalanced datasets in medical diagnosis hinders the development of accurate and reliable classification models. We propose a novel deep-learning framework for breast cancer classification from Mammogram scans. The framework addresses imbalanced data through a unique two-module pipeline incorporating the Synthetic Minority Over-sampling Technique (SMOTE). One module employs SMOTE on the entire dataset to balance class distribution. At the same time, the second separates a portion (20%) of the original imbalanced data for evaluation and applies SMOTE to the remaining 80%. The framework incorporates a blockwise Convolutional Neural Network (CNN), utilizing VGG16 preprocessing for input standardization and ResNet50 for feature extraction. A fully connected classification model, consisting of multiple dense layers with batch normalization and dropout for regularization, was developed to assess the extracted features. The model architecture was iteratively refined to combat overfitting, with the final version incorporating three dense layers (128, 256, and 128 neurons) with dropout rates of 0.5. Our model achieved 99% accuracy on a balanced dataset and 90% on an imbalanced portion. The framework includes an interpretable visualization technique for randomly selected predictions across all classes. Our approach significantly improves diagnostic accuracy in breast cancer classification from Mammogram scans, effectively addressing the challenge of imbalanced data in medical image analysis. This work contributes to medical image analysis and computer-aided diagnosis. The proposed techniques for handling imbalanced data and providing interpretable results can be extended to improve diagnostic accuracy across various medical conditions.
format Article
id doaj-art-1e783fb19fa24c40916a3a3399d3da95
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-1e783fb19fa24c40916a3a3399d3da952025-01-10T00:01:23ZengIEEEIEEE Access2169-35362025-01-01134815482610.1109/ACCESS.2024.352463310819401Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced DataAbdullah Fahad A. Alshamrani0https://orcid.org/0000-0001-5039-8094Faisal Saleh Zuhair Alshomrani1https://orcid.org/0009-0002-1207-2223Department of Diagnostic Radiology Technology, College of Applied Medical Science, Taibah University, Madinah, Saudi ArabiaDepartment of Diagnostic Radiology Technology, College of Applied Medical Science, Taibah University, Madinah, Saudi ArabiaBreast cancer is a global health concern where early and accurate diagnosis is crucial. Mammogram scans provide detailed imaging but require expert interpretation, which is time-consuming. While deep learning shows promise in medical image analysis, the prevalence of imbalanced datasets in medical diagnosis hinders the development of accurate and reliable classification models. We propose a novel deep-learning framework for breast cancer classification from Mammogram scans. The framework addresses imbalanced data through a unique two-module pipeline incorporating the Synthetic Minority Over-sampling Technique (SMOTE). One module employs SMOTE on the entire dataset to balance class distribution. At the same time, the second separates a portion (20%) of the original imbalanced data for evaluation and applies SMOTE to the remaining 80%. The framework incorporates a blockwise Convolutional Neural Network (CNN), utilizing VGG16 preprocessing for input standardization and ResNet50 for feature extraction. A fully connected classification model, consisting of multiple dense layers with batch normalization and dropout for regularization, was developed to assess the extracted features. The model architecture was iteratively refined to combat overfitting, with the final version incorporating three dense layers (128, 256, and 128 neurons) with dropout rates of 0.5. Our model achieved 99% accuracy on a balanced dataset and 90% on an imbalanced portion. The framework includes an interpretable visualization technique for randomly selected predictions across all classes. Our approach significantly improves diagnostic accuracy in breast cancer classification from Mammogram scans, effectively addressing the challenge of imbalanced data in medical image analysis. This work contributes to medical image analysis and computer-aided diagnosis. The proposed techniques for handling imbalanced data and providing interpretable results can be extended to improve diagnostic accuracy across various medical conditions.https://ieeexplore.ieee.org/document/10819401/Breast cancerBI-RADsclassificationdeep learningfeatures extractionsimbalance data
spellingShingle Abdullah Fahad A. Alshamrani
Faisal Saleh Zuhair Alshomrani
Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
IEEE Access
Breast cancer
BI-RADs
classification
deep learning
features extractions
imbalance data
title Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
title_full Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
title_fullStr Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
title_full_unstemmed Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
title_short Optimizing Breast Cancer Mammogram Classification Through a Dual Approach: A Deep Learning Framework Combining ResNet50, SMOTE, and Fully Connected Layers for Balanced and Imbalanced Data
title_sort optimizing breast cancer mammogram classification through a dual approach a deep learning framework combining resnet50 smote and fully connected layers for balanced and imbalanced data
topic Breast cancer
BI-RADs
classification
deep learning
features extractions
imbalance data
url https://ieeexplore.ieee.org/document/10819401/
work_keys_str_mv AT abdullahfahadaalshamrani optimizingbreastcancermammogramclassificationthroughadualapproachadeeplearningframeworkcombiningresnet50smoteandfullyconnectedlayersforbalancedandimbalanceddata
AT faisalsalehzuhairalshomrani optimizingbreastcancermammogramclassificationthroughadualapproachadeeplearningframeworkcombiningresnet50smoteandfullyconnectedlayersforbalancedandimbalanceddata