GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data

Objective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such a...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lueshi Li, Huafeng Zhang, Ruizhuo Song
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Blood glucose estimate feature engineering machine learning stacking data augmentation
Online Access:	https://ieeexplore.ieee.org/document/10670390/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846128575366823936
author	Lueshi Li Huafeng Zhang Ruizhuo Song
author_facet	Lueshi Li Huafeng Zhang Ruizhuo Song
author_sort	Lueshi Li
collection	DOAJ
description	Objective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such as high feature dimensions and uneven blood glucose distribution, which significantly affect the effect of machine learning models. Methods: This paper proposes a GA-KDE-GAN stacking model combined with feature engineering technology, referred to as the GKN framework. GKN integrates genetic algorithm and random forest (GA-RF) for feature selection, kernel density estimation (KDE) for data smoothing and small sample oversampling, and generative adversarial network (GAN) for expanding the training set. The framework uses GA-RF to select feature subsets and obtain the global optimal solution based on LightGBM evaluation, and applies KDE and GAN to balance the dataset. The final model adopts a stacking strategy to enhance the accuracy of blood glucose prediction. Results: By combining GKN feature engineering, the proposed model showed significant performance improvement. Under the challenging data high dimensionality and complexity, the model achieved a mean square error (MSE) of 1.529 and the highest R-square. More importantly, it significantly improved the accuracy of diabetes classification, with accuracy (Acc) and precision (Pre) exceeding 97%. Conclusion: This study addressed the problem of high feature dimension and uneven sample distribution in the physical examination dataset. The GKN framework proved to be effective in improving the prediction performance by integrating GA-RF, KDE and GAN. These findings are promising for glucose-assisted diagnosis of diabetes, as they can predict blood glucose levels based on routine blood test data and help in diabetes risk assessment.
format	Article
id	doaj-art-e6b4955ac9c342fda9fc42f604b9abc4
institution	Kabale University
issn	2169-3536
language	English
publishDate	2024-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-e6b4955ac9c342fda9fc42f604b9abc42024-12-11T00:04:52ZengIEEEIEEE Access2169-35362024-01-011217808917810310.1109/ACCESS.2024.345690810670390GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination DataLueshi Li0Huafeng Zhang1Ruizhuo Song2https://orcid.org/0000-0002-6693-2738Northeast Electric Power University, Jilin, ChinaBeijing Engineering Research Center of Industrial Spectrum Imaging, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, ChinaBeijing Engineering Research Center of Industrial Spectrum Imaging, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, ChinaObjective: Diabetes patients are closely related to blood glucose levels. Predicting blood glucose levels through routine blood test data can provide auxiliary diagnosis for diabetes risk prediction in the medical field. However, physical examination datasets are often accompanied by problems such as high feature dimensions and uneven blood glucose distribution, which significantly affect the effect of machine learning models. Methods: This paper proposes a GA-KDE-GAN stacking model combined with feature engineering technology, referred to as the GKN framework. GKN integrates genetic algorithm and random forest (GA-RF) for feature selection, kernel density estimation (KDE) for data smoothing and small sample oversampling, and generative adversarial network (GAN) for expanding the training set. The framework uses GA-RF to select feature subsets and obtain the global optimal solution based on LightGBM evaluation, and applies KDE and GAN to balance the dataset. The final model adopts a stacking strategy to enhance the accuracy of blood glucose prediction. Results: By combining GKN feature engineering, the proposed model showed significant performance improvement. Under the challenging data high dimensionality and complexity, the model achieved a mean square error (MSE) of 1.529 and the highest R-square. More importantly, it significantly improved the accuracy of diabetes classification, with accuracy (Acc) and precision (Pre) exceeding 97%. Conclusion: This study addressed the problem of high feature dimension and uneven sample distribution in the physical examination dataset. The GKN framework proved to be effective in improving the prediction performance by integrating GA-RF, KDE and GAN. These findings are promising for glucose-assisted diagnosis of diabetes, as they can predict blood glucose levels based on routine blood test data and help in diabetes risk assessment.https://ieeexplore.ieee.org/document/10670390/Blood glucose estimatefeature engineeringmachine learningstackingdata augmentation
spellingShingle	Lueshi Li Huafeng Zhang Ruizhuo Song GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data IEEE Access Blood glucose estimate feature engineering machine learning stacking data augmentation
title	GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_full	GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_fullStr	GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_full_unstemmed	GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_short	GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data
title_sort	gkn stack an ensemble deep learning framework for blood glucose forecasting based on medical examination data
topic	Blood glucose estimate feature engineering machine learning stacking data augmentation
url	https://ieeexplore.ieee.org/document/10670390/
work_keys_str_mv	AT lueshili gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata AT huafengzhang gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata AT ruizhuosong gknstackanensembledeeplearningframeworkforbloodglucoseforecastingbasedonmedicalexaminationdata

GKN-Stack: An Ensemble Deep Learning Framework for Blood Glucose Forecasting Based on Medical Examination Data

Similar Items