The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection

The pervasive issue of fake news spreading rapidly on online platforms. causing a concerning dissemination of misinformation. The influence of fake news has become a pressing social problem, shaping public opinion in important events such as elections. This research focuses on detecting and classify...

Full description

Saved in:
Bibliographic Details
Main Authors: Dhani Ariatmanto, Anggi Muhammad Rifai
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2024-12-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:https://jurnal.iaii.or.id/index.php/RESTI/article/view/6017
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544031161548800
author Dhani Ariatmanto
Anggi Muhammad Rifai
author_facet Dhani Ariatmanto
Anggi Muhammad Rifai
author_sort Dhani Ariatmanto
collection DOAJ
description The pervasive issue of fake news spreading rapidly on online platforms. causing a concerning dissemination of misinformation. The influence of fake news has become a pressing social problem, shaping public opinion in important events such as elections. This research focuses on detecting and classifying fake news using the Random Forest algorithm by investigating the impact of feature extraction techniques on classification accuracy, this study specifically employs the TF-IDF method. For this purpose, we used 44,898 English-language articles from the ISOT fake news dataset. The dataset is cleaned using tokenization and stemming then split into 75% training and 25% testing. The TF-IDF vectorizer technique was applied to convert text into numeric as feature extraction. This study has implemented a Random Forest classifier to predict real and fake news. The proposed model contributes to overall classification precision by comparing it to the existing models. This fake news detection highlights the efficacy of the TF-IDF vectorizer and Random Forest combination which achieved an impressive accuracy rate of 99.0%. This contribution highlights an effective strategy for combating misinformation through precise text classification.
format Article
id doaj-art-01dc41a0a7a5435db9002daac7dae7ec
institution Kabale University
issn 2580-0760
language English
publishDate 2024-12-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj-art-01dc41a0a7a5435db9002daac7dae7ec2025-01-13T03:30:32ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602024-12-018673073610.29207/resti.v8i6.60176017The Impact of Feature Extraction in Random Forest Classifier for Fake News DetectionDhani Ariatmanto0Anggi Muhammad Rifai1Universitas AMIKOM YogyakartaUniversitas Pelita BangsaThe pervasive issue of fake news spreading rapidly on online platforms. causing a concerning dissemination of misinformation. The influence of fake news has become a pressing social problem, shaping public opinion in important events such as elections. This research focuses on detecting and classifying fake news using the Random Forest algorithm by investigating the impact of feature extraction techniques on classification accuracy, this study specifically employs the TF-IDF method. For this purpose, we used 44,898 English-language articles from the ISOT fake news dataset. The dataset is cleaned using tokenization and stemming then split into 75% training and 25% testing. The TF-IDF vectorizer technique was applied to convert text into numeric as feature extraction. This study has implemented a Random Forest classifier to predict real and fake news. The proposed model contributes to overall classification precision by comparing it to the existing models. This fake news detection highlights the efficacy of the TF-IDF vectorizer and Random Forest combination which achieved an impressive accuracy rate of 99.0%. This contribution highlights an effective strategy for combating misinformation through precise text classification.https://jurnal.iaii.or.id/index.php/RESTI/article/view/6017fake newsrandom foresttext classificationmachine learningfeature extraction
spellingShingle Dhani Ariatmanto
Anggi Muhammad Rifai
The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
fake news
random forest
text classification
machine learning
feature extraction
title The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
title_full The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
title_fullStr The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
title_full_unstemmed The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
title_short The Impact of Feature Extraction in Random Forest Classifier for Fake News Detection
title_sort impact of feature extraction in random forest classifier for fake news detection
topic fake news
random forest
text classification
machine learning
feature extraction
url https://jurnal.iaii.or.id/index.php/RESTI/article/view/6017
work_keys_str_mv AT dhaniariatmanto theimpactoffeatureextractioninrandomforestclassifierforfakenewsdetection
AT anggimuhammadrifai theimpactoffeatureextractioninrandomforestclassifierforfakenewsdetection
AT dhaniariatmanto impactoffeatureextractioninrandomforestclassifierforfakenewsdetection
AT anggimuhammadrifai impactoffeatureextractioninrandomforestclassifierforfakenewsdetection