Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection

Brain stroke stands out as a leading cause of death, distinguishing it from common illnesses and highlighting the critical need to utilize machine learning techniques to identify symptoms. Among these techniques, the Random Forest (RF) algorithm emerged as the main candidate because of its optimal a...

Full description

Saved in:
Bibliographic Details
Main Authors: Fachruddin Fachruddin, Errissya Rasywir, Yovi Pratama
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2024-08-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:https://jurnal.iaii.or.id/index.php/RESTI/article/view/5795
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544038878019584
author Fachruddin Fachruddin
Errissya Rasywir
Yovi Pratama
author_facet Fachruddin Fachruddin
Errissya Rasywir
Yovi Pratama
author_sort Fachruddin Fachruddin
collection DOAJ
description Brain stroke stands out as a leading cause of death, distinguishing it from common illnesses and highlighting the critical need to utilize machine learning techniques to identify symptoms. Among these techniques, the Random Forest (RF) algorithm emerged as the main candidate because of its optimal accuracy values. RF was chosen for its ensemble learning properties that optimize accuracy while simultaneously, bagging all outputs (DT), thus increasing its efficacy. Feature Selection, an important data analysis step, which is mainly achieved through pre-processing, aims to identify influential features and ignore less impactful features. Mutual Information serves as an important feature selection method. Specifically, the highest level of accuracy was achieved by cross-validating the test data - 10, resulting in 0.7760 without feature selection and 0.7790 with mutual information. Most of the attributes in the brain stroke dataset show relevance to the stroke disease class, but the resulting decision tree shows age as a particularly important node. So, the research results show that the selection feature (Mutual Information) can increase the accuracy of brain stroke classification, although it is not significant, namely an increase of 0.0030%. With an increase, where there is no significant difference, it can be said that almost all the attributes contained in the brain stroke dataset used have an influence on their relevance to the stroke disease class.
format Article
id doaj-art-4d19316fa81b45989139ac0876271f97
institution Kabale University
issn 2580-0760
language English
publishDate 2024-08-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj-art-4d19316fa81b45989139ac0876271f972025-01-13T03:33:02ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602024-08-018455556210.29207/resti.v8i4.57955795Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature SelectionFachruddin Fachruddin0Errissya Rasywir1Yovi Pratama2Universitas Dinamika BangsaUniversitas Dinamika BangsaUniversitas Dinamika BangsaBrain stroke stands out as a leading cause of death, distinguishing it from common illnesses and highlighting the critical need to utilize machine learning techniques to identify symptoms. Among these techniques, the Random Forest (RF) algorithm emerged as the main candidate because of its optimal accuracy values. RF was chosen for its ensemble learning properties that optimize accuracy while simultaneously, bagging all outputs (DT), thus increasing its efficacy. Feature Selection, an important data analysis step, which is mainly achieved through pre-processing, aims to identify influential features and ignore less impactful features. Mutual Information serves as an important feature selection method. Specifically, the highest level of accuracy was achieved by cross-validating the test data - 10, resulting in 0.7760 without feature selection and 0.7790 with mutual information. Most of the attributes in the brain stroke dataset show relevance to the stroke disease class, but the resulting decision tree shows age as a particularly important node. So, the research results show that the selection feature (Mutual Information) can increase the accuracy of brain stroke classification, although it is not significant, namely an increase of 0.0030%. With an increase, where there is no significant difference, it can be said that almost all the attributes contained in the brain stroke dataset used have an influence on their relevance to the stroke disease class.https://jurnal.iaii.or.id/index.php/RESTI/article/view/5795random foreststrokesbrainmutual informationfeatures
spellingShingle Fachruddin Fachruddin
Errissya Rasywir
Yovi Pratama
Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
random forest
strokes
brain
mutual information
features
title Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
title_full Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
title_fullStr Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
title_full_unstemmed Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
title_short Increasing the Accuracy of Brain Stroke Classification using Random Forest Algorithm with Mutual Information Feature Selection
title_sort increasing the accuracy of brain stroke classification using random forest algorithm with mutual information feature selection
topic random forest
strokes
brain
mutual information
features
url https://jurnal.iaii.or.id/index.php/RESTI/article/view/5795
work_keys_str_mv AT fachruddinfachruddin increasingtheaccuracyofbrainstrokeclassificationusingrandomforestalgorithmwithmutualinformationfeatureselection
AT errissyarasywir increasingtheaccuracyofbrainstrokeclassificationusingrandomforestalgorithmwithmutualinformationfeatureselection
AT yovipratama increasingtheaccuracyofbrainstrokeclassificationusingrandomforestalgorithmwithmutualinformationfeatureselection