Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction

Diabetes Mellitus is a long-term medical disorder marked by high blood glucose levels that raise the risk of early mortality and organ failure. It has become an increasing global health problem, so making an accurate and timely diagnosis is urgently necessary. This study aims to diagnose people with...

Full description

Saved in:
Bibliographic Details
Main Authors: Aufar Faiq Fadhlullah, Triyanna Widiyaningtyas
Format: Article
Language:English
Published: Universitas Muhammadiyah Mataram 2024-10-01
Series:JTAM (Jurnal Teori dan Aplikasi Matematika)
Subjects:
Online Access:http://journal.ummat.ac.id/index.php/jtam/article/view/24388
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846166497739669504
author Aufar Faiq Fadhlullah
Triyanna Widiyaningtyas
author_facet Aufar Faiq Fadhlullah
Triyanna Widiyaningtyas
author_sort Aufar Faiq Fadhlullah
collection DOAJ
description Diabetes Mellitus is a long-term medical disorder marked by high blood glucose levels that raise the risk of early mortality and organ failure. It has become an increasing global health problem, so making an accurate and timely diagnosis is urgently necessary. This study aims to diagnose people with diabetes mellitus by utilizing prediction techniques in data mining using experimental research. The prediction stage for diagnosing diabetes consists of four stages: dataset collection, data pre-processing, data processing, and evaluation. Data was obtained from Electronic Health Records (EHRs), namely the public "Diabetes Prediction Dataset". The pre-processing stage involves data filtering, attribute conversion, and class selection. The data processing utilizes random forests and decision tree models for diabetes prediction. The models were evaluated using accuracy, precision, and recall metrics. The results showed that the Random Forest algorithm produced an accuracy value of 93.97%, precision of 99.88%, and recall of 66.56%, with a computational time of 16s. Meanwhile, the decision tree algorithm produces an accuracy value of 93.89%, precision of 98.73%, and recall of 66.88%, with a computation time of less than 1s. Based on these results, it can be concluded that the Decision Tree algorithm is more effective because the difference in accuracy, precision, and recall values produced by the two algorithms does not have significant differences. However, the Decision Tree algorithm has the advantage of using computational time more effectively, which is needed in detecting diabetes because it is related to someone's life.
format Article
id doaj-art-9ad3b75ee7ff4f38b84b9d8777c578d0
institution Kabale University
issn 2597-7512
2614-1175
language English
publishDate 2024-10-01
publisher Universitas Muhammadiyah Mataram
record_format Article
series JTAM (Jurnal Teori dan Aplikasi Matematika)
spelling doaj-art-9ad3b75ee7ff4f38b84b9d8777c578d02024-11-15T16:19:27ZengUniversitas Muhammadiyah MataramJTAM (Jurnal Teori dan Aplikasi Matematika)2597-75122614-11752024-10-01841121113210.31764/jtam.v8i4.243889993Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes PredictionAufar Faiq Fadhlullah0Triyanna Widiyaningtyas1Department of Electrical Engineering and Informatics, Universitas Negeri MalangDepartment of Electrical Engineering and Informatics, Universitas Negeri MalangDiabetes Mellitus is a long-term medical disorder marked by high blood glucose levels that raise the risk of early mortality and organ failure. It has become an increasing global health problem, so making an accurate and timely diagnosis is urgently necessary. This study aims to diagnose people with diabetes mellitus by utilizing prediction techniques in data mining using experimental research. The prediction stage for diagnosing diabetes consists of four stages: dataset collection, data pre-processing, data processing, and evaluation. Data was obtained from Electronic Health Records (EHRs), namely the public "Diabetes Prediction Dataset". The pre-processing stage involves data filtering, attribute conversion, and class selection. The data processing utilizes random forests and decision tree models for diabetes prediction. The models were evaluated using accuracy, precision, and recall metrics. The results showed that the Random Forest algorithm produced an accuracy value of 93.97%, precision of 99.88%, and recall of 66.56%, with a computational time of 16s. Meanwhile, the decision tree algorithm produces an accuracy value of 93.89%, precision of 98.73%, and recall of 66.88%, with a computation time of less than 1s. Based on these results, it can be concluded that the Decision Tree algorithm is more effective because the difference in accuracy, precision, and recall values produced by the two algorithms does not have significant differences. However, the Decision Tree algorithm has the advantage of using computational time more effectively, which is needed in detecting diabetes because it is related to someone's life.http://journal.ummat.ac.id/index.php/jtam/article/view/24388diabetes mellitusprediction algorithmrandom forestdecision tree.
spellingShingle Aufar Faiq Fadhlullah
Triyanna Widiyaningtyas
Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
JTAM (Jurnal Teori dan Aplikasi Matematika)
diabetes mellitus
prediction algorithm
random forest
decision tree.
title Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
title_full Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
title_fullStr Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
title_full_unstemmed Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
title_short Comparative Analysis of Decision Tree and Random Forest Algorithms for Diabetes Prediction
title_sort comparative analysis of decision tree and random forest algorithms for diabetes prediction
topic diabetes mellitus
prediction algorithm
random forest
decision tree.
url http://journal.ummat.ac.id/index.php/jtam/article/view/24388
work_keys_str_mv AT aufarfaiqfadhlullah comparativeanalysisofdecisiontreeandrandomforestalgorithmsfordiabetesprediction
AT triyannawidiyaningtyas comparativeanalysisofdecisiontreeandrandomforestalgorithmsfordiabetesprediction