Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection

The use of credit cards in the modern era is increasing. Therefore, it is necessary to prevent it with the use of technology such as address verification systems (AVS), card verification methods (CVM), and personal identification Numbers (PIN). Dataset analysis needs to be carried out to analyze the...

Full description

Saved in:
Bibliographic Details
Main Authors: Arief Tri Arsanto, Arif Faizin, Moch lutfi, Zulfatun Nikmatus Saadah
Format: Article
Language:Indonesian
Published: Islamic University of Indragiri 2024-11-01
Series:Sistemasi: Jurnal Sistem Informasi
Online Access:https://sistemasi.ftik.unisi.ac.id/index.php/stmsi/article/view/4719
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841555765122301952
author Arief Tri Arsanto
Arif Faizin
Moch lutfi
Zulfatun Nikmatus Saadah
author_facet Arief Tri Arsanto
Arif Faizin
Moch lutfi
Zulfatun Nikmatus Saadah
author_sort Arief Tri Arsanto
collection DOAJ
description The use of credit cards in the modern era is increasing. Therefore, it is necessary to prevent it with the use of technology such as address verification systems (AVS), card verification methods (CVM), and personal identification Numbers (PIN). Dataset analysis needs to be carried out to analyze the history of transactions that have been carried out. In the fraud detection dataset, it can be seen that there are attributes that cause data imbalance. Class imbalance in a dataset is a significant problem in machine learning that can affect overall model performance. The number of majority samples is more significant in one class than the number of minority classes. This research used an oversampling approach using a combination of smote and tomek-link. The focus of this research is card fraud classification. Detection of imbalanced datasets or imbalanced classes is carried out using the Naive Bayes method as a classification algorithm. In addition, a combination of resampling techniques is also applied to overcome imbalanced classes in this dataset through the SMOTETomek approach. SMOTETomek is a method that reduces the number of samples by considering two adjacent data from the minority and majority classes. Meanwhile, from the problems above, the results of the performance of Naïve Bayes, which experienced issues with data imbalance in this study, a resampling method was proposed in the hope of improving the performance of the Naïve Bayes algorithm and in the results of the AUC ROC curve, the SMOTETomek method could improve the performance of the Naïve Bayes algorithm. The higher the ROC score. -AUC, the better the model performance in terms of its ability to differentiate between two classes, but the accuracy results do not experience a significant change.
format Article
id doaj-art-941042476d0d4fce84a5e2c4835bf9b4
institution Kabale University
issn 2302-8149
2540-9719
language Indonesian
publishDate 2024-11-01
publisher Islamic University of Indragiri
record_format Article
series Sistemasi: Jurnal Sistem Informasi
spelling doaj-art-941042476d0d4fce84a5e2c4835bf9b42025-01-08T03:10:27ZindIslamic University of IndragiriSistemasi: Jurnal Sistem Informasi2302-81492540-97192024-11-011362709272110.32520/stmsi.v13i6.4719924Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud DetectionArief Tri Arsanto0Arif Faizin1Moch lutfi2Zulfatun Nikmatus Saadah3Universitas Yudharta PasuruanUniversitas Yudharta PasuruanUniversitas Yudharta PasuruanUniversitas Yudharta PasuruanThe use of credit cards in the modern era is increasing. Therefore, it is necessary to prevent it with the use of technology such as address verification systems (AVS), card verification methods (CVM), and personal identification Numbers (PIN). Dataset analysis needs to be carried out to analyze the history of transactions that have been carried out. In the fraud detection dataset, it can be seen that there are attributes that cause data imbalance. Class imbalance in a dataset is a significant problem in machine learning that can affect overall model performance. The number of majority samples is more significant in one class than the number of minority classes. This research used an oversampling approach using a combination of smote and tomek-link. The focus of this research is card fraud classification. Detection of imbalanced datasets or imbalanced classes is carried out using the Naive Bayes method as a classification algorithm. In addition, a combination of resampling techniques is also applied to overcome imbalanced classes in this dataset through the SMOTETomek approach. SMOTETomek is a method that reduces the number of samples by considering two adjacent data from the minority and majority classes. Meanwhile, from the problems above, the results of the performance of Naïve Bayes, which experienced issues with data imbalance in this study, a resampling method was proposed in the hope of improving the performance of the Naïve Bayes algorithm and in the results of the AUC ROC curve, the SMOTETomek method could improve the performance of the Naïve Bayes algorithm. The higher the ROC score. -AUC, the better the model performance in terms of its ability to differentiate between two classes, but the accuracy results do not experience a significant change.https://sistemasi.ftik.unisi.ac.id/index.php/stmsi/article/view/4719
spellingShingle Arief Tri Arsanto
Arif Faizin
Moch lutfi
Zulfatun Nikmatus Saadah
Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
Sistemasi: Jurnal Sistem Informasi
title Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
title_full Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
title_fullStr Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
title_full_unstemmed Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
title_short Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection
title_sort optimization of the naive bayes algorithm with smotetomek combination for imbalance class fraud detection
url https://sistemasi.ftik.unisi.ac.id/index.php/stmsi/article/view/4719
work_keys_str_mv AT arieftriarsanto optimizationofthenaivebayesalgorithmwithsmotetomekcombinationforimbalanceclassfrauddetection
AT ariffaizin optimizationofthenaivebayesalgorithmwithsmotetomekcombinationforimbalanceclassfrauddetection
AT mochlutfi optimizationofthenaivebayesalgorithmwithsmotetomekcombinationforimbalanceclassfrauddetection
AT zulfatunnikmatussaadah optimizationofthenaivebayesalgorithmwithsmotetomekcombinationforimbalanceclassfrauddetection