Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets
IntroductionDiabetes prediction using clinical datasets is crucial for medical data analysis. However, class imbalances, where non-diabetic cases dominate, can significantly affect machine learning model performance, leading to biased predictions and reduced generalization.MethodsA novel predictive...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Artificial Intelligence |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frai.2024.1499530/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841556755907084288 |
---|---|
author | Inam Abousaber Haitham F. Abdallah Hany El-Ghaish |
author_facet | Inam Abousaber Haitham F. Abdallah Hany El-Ghaish |
author_sort | Inam Abousaber |
collection | DOAJ |
description | IntroductionDiabetes prediction using clinical datasets is crucial for medical data analysis. However, class imbalances, where non-diabetic cases dominate, can significantly affect machine learning model performance, leading to biased predictions and reduced generalization.MethodsA novel predictive framework employing cutting-edge machine learning algorithms and advanced imbalance handling techniques was developed. The framework integrates feature engineering and resampling strategies to enhance predictive accuracy.ResultsRigorous testing was conducted on three datasets—PIMA, Diabetes Dataset 2019, and BIT_2019—demonstrating the robustness and adaptability of the methodology across varying data environments.DiscussionThe experimental results highlight the critical role of model selection and imbalance mitigation in achieving reliable and generalizable diabetes predictions. This study offers significant contributions to medical informatics by proposing a robust data-driven framework that addresses class imbalance challenges, thereby advancing diabetes prediction accuracy. |
format | Article |
id | doaj-art-e63692ce8ecd469e9a8b4703fe27a80c |
institution | Kabale University |
issn | 2624-8212 |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Artificial Intelligence |
spelling | doaj-art-e63692ce8ecd469e9a8b4703fe27a80c2025-01-07T06:42:25ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14995301499530Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasetsInam Abousaber0Haitham F. Abdallah1Hany El-Ghaish2Department of Information Technology, Faculty of Computers and Information Technology, University of Tabuk, Tabuk, Saudi ArabiaDepartment of Electronics and Electrical Communication, Higher Institute of Engineering and Technology, Kafr El Sheikh, EgyptDepartment of Computer and Automatic Control, Faculty of Engineering, Tanta University, Tanta, EgyptIntroductionDiabetes prediction using clinical datasets is crucial for medical data analysis. However, class imbalances, where non-diabetic cases dominate, can significantly affect machine learning model performance, leading to biased predictions and reduced generalization.MethodsA novel predictive framework employing cutting-edge machine learning algorithms and advanced imbalance handling techniques was developed. The framework integrates feature engineering and resampling strategies to enhance predictive accuracy.ResultsRigorous testing was conducted on three datasets—PIMA, Diabetes Dataset 2019, and BIT_2019—demonstrating the robustness and adaptability of the methodology across varying data environments.DiscussionThe experimental results highlight the critical role of model selection and imbalance mitigation in achieving reliable and generalizable diabetes predictions. This study offers significant contributions to medical informatics by proposing a robust data-driven framework that addresses class imbalance challenges, thereby advancing diabetes prediction accuracy.https://www.frontiersin.org/articles/10.3389/frai.2024.1499530/fulldiabetes detectionimbalance handling methodsimbalanced datasetsmachine learningstatistical analysis |
spellingShingle | Inam Abousaber Haitham F. Abdallah Hany El-Ghaish Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets Frontiers in Artificial Intelligence diabetes detection imbalance handling methods imbalanced datasets machine learning statistical analysis |
title | Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
title_full | Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
title_fullStr | Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
title_full_unstemmed | Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
title_short | Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
title_sort | robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets |
topic | diabetes detection imbalance handling methods imbalanced datasets machine learning statistical analysis |
url | https://www.frontiersin.org/articles/10.3389/frai.2024.1499530/full |
work_keys_str_mv | AT inamabousaber robustpredictiveframeworkfordiabetesclassificationusingoptimizedmachinelearningonimbalanceddatasets AT haithamfabdallah robustpredictiveframeworkfordiabetesclassificationusingoptimizedmachinelearningonimbalanceddatasets AT hanyelghaish robustpredictiveframeworkfordiabetesclassificationusingoptimizedmachinelearningonimbalanceddatasets |