GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data

Objective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such a...

Full description

Saved in:
Bibliographic Details
Main Authors: Lueshi Li, Huafeng Zhang, Ruizhuo Song
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10670390/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846128575366823936
author Lueshi Li
Huafeng Zhang
Ruizhuo Song
author_facet Lueshi Li
Huafeng Zhang
Ruizhuo Song
author_sort Lueshi Li
collection DOAJ
description Objective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such as high feature dimensions and uneven blood glucose distribution, which significantly affect the effect of machine learning models. Methods: This paper proposes a GA-KDE-GAN stacking model combined with feature engineering technology, referred to as the GKN framework. GKN integrates genetic algorithm and random forest (GA-RF) for feature selection, kernel density estimation (KDE) for data smoothing and small sample oversampling, and generative adversarial network (GAN) for expanding the training set. The framework uses GA-RF to select feature subsets and obtain the global optimal solution based on LightGBM evaluation, and applies KDE and GAN to balance the dataset. The final model adopts a stacking strategy to enhance the accuracy of blood glucose prediction. Results: By combining GKN feature engineering, the proposed model showed significant performance improvement. Under the challenging data high dimensionality and complexity, the model achieved a mean square error (MSE) of 1.529 and the highest R-square. More importantly, it significantly improved the accuracy of diabetes classification, with accuracy (Acc) and precision (Pre) exceeding 97%. Conclusion: This study addressed the problem of high feature dimension and uneven sample distribution in the physical examination dataset. The GKN framework proved to be effective in improving the prediction performance by integrating GA-RF, KDE and GAN. These findings are promising for glucose-assisted diagnosis of diabetes, as they can predict blood glucose levels based on routine blood test data and help in diabetes risk assessment.
format Article
id doaj-art-e6b4955ac9c342fda9fc42f604b9abc4
institution Kabale University
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-e6b4955ac9c342fda9fc42f604b9abc42024-12-11T00:04:52ZengIEEEIEEE Access2169-35362024-01-011217808917810310.1109/ACCESS.2024.345690810670390GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination DataLueshi Li0Huafeng Zhang1Ruizhuo Song2https://orcid.org/0000-0002-6693-2738Northeast Electric Power University, Jilin, ChinaBeijing Engineering Research Center of Industrial Spectrum Imaging, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, ChinaBeijing Engineering Research Center of Industrial Spectrum Imaging, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, ChinaObjective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such as high feature dimensions and uneven blood glucose distribution, which significantly affect the effect of machine learning models. Methods: This paper proposes a GA-KDE-GAN stacking model combined with feature engineering technology, referred to as the GKN framework. GKN integrates genetic algorithm and random forest (GA-RF) for feature selection, kernel density estimation (KDE) for data smoothing and small sample oversampling, and generative adversarial network (GAN) for expanding the training set. The framework uses GA-RF to select feature subsets and obtain the global optimal solution based on LightGBM evaluation, and applies KDE and GAN to balance the dataset. The final model adopts a stacking strategy to enhance the accuracy of blood glucose prediction. Results: By combining GKN feature engineering, the proposed model showed significant performance improvement. Under the challenging data high dimensionality and complexity, the model achieved a mean square error (MSE) of 1.529 and the highest R-square. More importantly, it significantly improved the accuracy of diabetes classification, with accuracy (Acc) and precision (Pre) exceeding 97%. Conclusion: This study addressed the problem of high feature dimension and uneven sample distribution in the physical examination dataset. The GKN framework proved to be effective in improving the prediction performance by integrating GA-RF, KDE and GAN. These findings are promising for glucose-assisted diagnosis of diabetes, as they can predict blood glucose levels based on routine blood test data and help in diabetes risk assessment.https://ieeexplore.ieee.org/document/10670390/Blood glucose estimatefeature engineeringmachine learningstackingdata augmentation
spellingShingle Lueshi Li
Huafeng Zhang
Ruizhuo Song
GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
IEEE Access
Blood glucose estimate
feature engineering
machine learning
stacking
data augmentation
title GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_full GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_fullStr GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_full_unstemmed GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_short GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_sort gkn stack an ensemble deep learning framework for blood glucose forecasting based on medical examination data
topic Blood glucose estimate
feature engineering
machine learning
stacking
data augmentation
url https://ieeexplore.ieee.org/document/10670390/
work_keys_str_mv AT lueshili gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata
AT huafengzhang gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata
AT ruizhuosong gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata