The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks

Indonesia's maternal mortality rate was the second highest in ASEAN, reflecting the problem of class imbalance in maternal health data. This research aimed to improve prediction accuracy in the classification of pregnant women's diseases through the application of various resampling method...

Full description

Saved in:
Bibliographic Details
Main Authors: Nia Mauliza, Aisha Shakila Iedwan, Yoga Pristyanto, Anggit Dwi Hartanto, Arif Nur Rohman
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2024-08-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:https://jurnal.iaii.or.id/index.php/RESTI/article/view/5934
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544042213539840
author Nia Mauliza
Aisha Shakila Iedwan
Yoga Pristyanto
Anggit Dwi Hartanto
Arif Nur Rohman
author_facet Nia Mauliza
Aisha Shakila Iedwan
Yoga Pristyanto
Anggit Dwi Hartanto
Arif Nur Rohman
author_sort Nia Mauliza
collection DOAJ
description Indonesia's maternal mortality rate was the second highest in ASEAN, reflecting the problem of class imbalance in maternal health data. This research aimed to improve prediction accuracy in the classification of pregnant women's diseases through the application of various resampling methods. The methods used in this research included Synthetic Minority Over-sampling Technique (SMOTE), SMOTE-Edited Nearest Neighbor (SMOTE-ENN), Adaptive Synthetic Sampling (ADASYN), and ADASYN-ENN, using five classification algorithms: Decision Tree, K-Nearest Neighbor (KNN), Naïve Bayes, Random Forest, and Support Vector Machine (SVM). Performance evaluation was carried out using accuracy, precision, recall, and F1-score metrics to determine the best method and algorithm. The results showed that the SMOTE-ENN and ADASYN-ENN methods significantly improved the model's performance in predicting maternal disease. Random Forest and Decision Tree algorithms showed the best results in terms of accuracy and consistency. These findings provided practical guidance for the application of resampling techniques in the classification of pregnant women's health data, which could contribute to improving the quality of maternal health services in Indonesia.
format Article
id doaj-art-5f9dd238116242c0a32bfe95e2aba35b
institution Kabale University
issn 2580-0760
language English
publishDate 2024-08-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj-art-5f9dd238116242c0a32bfe95e2aba35b2025-01-13T03:33:02ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602024-08-018449650510.29207/resti.v8i4.59345934The Effect of Resampling Techniques on Model Performance Classification of Maternal Health RisksNia Mauliza0Aisha Shakila Iedwan1Yoga Pristyanto2Anggit Dwi Hartanto3Arif Nur Rohman4Universitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaUniversitas Amikom YogyakartaIndonesia's maternal mortality rate was the second highest in ASEAN, reflecting the problem of class imbalance in maternal health data. This research aimed to improve prediction accuracy in the classification of pregnant women's diseases through the application of various resampling methods. The methods used in this research included Synthetic Minority Over-sampling Technique (SMOTE), SMOTE-Edited Nearest Neighbor (SMOTE-ENN), Adaptive Synthetic Sampling (ADASYN), and ADASYN-ENN, using five classification algorithms: Decision Tree, K-Nearest Neighbor (KNN), Naïve Bayes, Random Forest, and Support Vector Machine (SVM). Performance evaluation was carried out using accuracy, precision, recall, and F1-score metrics to determine the best method and algorithm. The results showed that the SMOTE-ENN and ADASYN-ENN methods significantly improved the model's performance in predicting maternal disease. Random Forest and Decision Tree algorithms showed the best results in terms of accuracy and consistency. These findings provided practical guidance for the application of resampling techniques in the classification of pregnant women's health data, which could contribute to improving the quality of maternal health services in Indonesia.https://jurnal.iaii.or.id/index.php/RESTI/article/view/5934class imbalanceresampling methodsclassification algorithmsmaternal healthprediction accuracymachine learning
spellingShingle Nia Mauliza
Aisha Shakila Iedwan
Yoga Pristyanto
Anggit Dwi Hartanto
Arif Nur Rohman
The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
class imbalance
resampling methods
classification algorithms
maternal health
prediction accuracy
machine learning
title The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
title_full The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
title_fullStr The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
title_full_unstemmed The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
title_short The Effect of Resampling Techniques on Model Performance Classification of Maternal Health Risks
title_sort effect of resampling techniques on model performance classification of maternal health risks
topic class imbalance
resampling methods
classification algorithms
maternal health
prediction accuracy
machine learning
url https://jurnal.iaii.or.id/index.php/RESTI/article/view/5934
work_keys_str_mv AT niamauliza theeffectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT aishashakilaiedwan theeffectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT yogapristyanto theeffectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT anggitdwihartanto theeffectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT arifnurrohman theeffectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT niamauliza effectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT aishashakilaiedwan effectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT yogapristyanto effectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT anggitdwihartanto effectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks
AT arifnurrohman effectofresamplingtechniquesonmodelperformanceclassificationofmaternalhealthrisks