Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value

Diabetes mellitus is a chronic disease often caused by high blood glucose levels and insufficient insulin production. This research aims to address the classification problem of diabetes mellitus using the K-Nearest Neighbor (K-NN) method. The aim of this research is to create a machine learning mod...

Full description

Saved in:
Bibliographic Details
Main Authors: Putro Sigit Susanto, Putra Moh Abdan Syakura, Fatah Doni Abdul, Asmara Yuli Panca, Fauzan Hermawan Bin, Rochman Eka Mala Sari, Rachmad Aeri
Format: Article
Language:English
Published: EDP Sciences 2024-01-01
Series:BIO Web of Conferences
Online Access:https://www.bio-conferences.org/articles/bioconf/pdf/2024/65/bioconf_btmic2024_01081.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846139482620821504
author Putro Sigit Susanto
Putra Moh Abdan Syakura
Fatah Doni Abdul
Asmara Yuli Panca
Fauzan Hermawan Bin
Rochman Eka Mala Sari
Rachmad Aeri
author_facet Putro Sigit Susanto
Putra Moh Abdan Syakura
Fatah Doni Abdul
Asmara Yuli Panca
Fauzan Hermawan Bin
Rochman Eka Mala Sari
Rachmad Aeri
author_sort Putro Sigit Susanto
collection DOAJ
description Diabetes mellitus is a chronic disease often caused by high blood glucose levels and insufficient insulin production. This research aims to address the classification problem of diabetes mellitus using the K-Nearest Neighbor (K-NN) method. The aim of this research is to create a machine learning model that can detect diabetes early. The study was conducted at Syarifah Ambami Rato Ebu Hospital in Bangkalan, utilizing data from 120 patients in 2019, employing data mining techniques to classify diabetes mellitus patients. Additionally, the steps in data mining involve determining significant variables or features for classification Cleansing and normalization and transformation. The research compares training test results with ratios of 90:10, 80:20, and 70:30. Experimental results show that K-NN with a neighbor value of K=11 achieves the highest accuracy rate of 83% a reduced error rate of 16.67%, and the highest AUC value of 0.7407. These results indicate that the 90:10 data split ratio yields the best model performance in terms of accuracy and class differentiation for diabetes mellitus, as well as the lowest error rate compared to other data split ratios. This study provides a better understanding of diabetes mellitus and demonstrates that K-NN is effective in addressing classification problems, focusing on specific variables that influence the disease. Therefore, it can be concluded that K-Nearest Neighbor (K-NN) is a suitable algorithm for classifying diabetes mellitus.
format Article
id doaj-art-75e80f8a9a124c819aef21e858ddb09d
institution Kabale University
issn 2117-4458
language English
publishDate 2024-01-01
publisher EDP Sciences
record_format Article
series BIO Web of Conferences
spelling doaj-art-75e80f8a9a124c819aef21e858ddb09d2024-12-06T09:33:56ZengEDP SciencesBIO Web of Conferences2117-44582024-01-011460108110.1051/bioconf/202414601081bioconf_btmic2024_01081Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing valuePutro Sigit Susanto0Putra Moh Abdan Syakura1Fatah Doni Abdul2Asmara Yuli Panca3Fauzan Hermawan Bin4Rochman Eka Mala Sari5Rachmad Aeri6Departemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraDepartemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraDepartemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraFaculty of Engineering and Quantity Surveying, INTI International UniversityDepartemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraDepartemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraDepartemen of Informatics, Faculty of Engineering, University of Trunojoyo MaduraDiabetes mellitus is a chronic disease often caused by high blood glucose levels and insufficient insulin production. This research aims to address the classification problem of diabetes mellitus using the K-Nearest Neighbor (K-NN) method. The aim of this research is to create a machine learning model that can detect diabetes early. The study was conducted at Syarifah Ambami Rato Ebu Hospital in Bangkalan, utilizing data from 120 patients in 2019, employing data mining techniques to classify diabetes mellitus patients. Additionally, the steps in data mining involve determining significant variables or features for classification Cleansing and normalization and transformation. The research compares training test results with ratios of 90:10, 80:20, and 70:30. Experimental results show that K-NN with a neighbor value of K=11 achieves the highest accuracy rate of 83% a reduced error rate of 16.67%, and the highest AUC value of 0.7407. These results indicate that the 90:10 data split ratio yields the best model performance in terms of accuracy and class differentiation for diabetes mellitus, as well as the lowest error rate compared to other data split ratios. This study provides a better understanding of diabetes mellitus and demonstrates that K-NN is effective in addressing classification problems, focusing on specific variables that influence the disease. Therefore, it can be concluded that K-Nearest Neighbor (K-NN) is a suitable algorithm for classifying diabetes mellitus.https://www.bio-conferences.org/articles/bioconf/pdf/2024/65/bioconf_btmic2024_01081.pdf
spellingShingle Putro Sigit Susanto
Putra Moh Abdan Syakura
Fatah Doni Abdul
Asmara Yuli Panca
Fauzan Hermawan Bin
Rochman Eka Mala Sari
Rachmad Aeri
Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
BIO Web of Conferences
title Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
title_full Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
title_fullStr Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
title_full_unstemmed Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
title_short Classification of diabetes mellitus disease at Rato Ebuh Hospital-Indonesia using the K-Nearest neighbors method based on missing value
title_sort classification of diabetes mellitus disease at rato ebuh hospital indonesia using the k nearest neighbors method based on missing value
url https://www.bio-conferences.org/articles/bioconf/pdf/2024/65/bioconf_btmic2024_01081.pdf
work_keys_str_mv AT putrosigitsusanto classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT putramohabdansyakura classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT fatahdoniabdul classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT asmarayulipanca classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT fauzanhermawanbin classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT rochmanekamalasari classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue
AT rachmadaeri classificationofdiabetesmellitusdiseaseatratoebuhhospitalindonesiausingtheknearestneighborsmethodbasedonmissingvalue