Spam detection using hybrid model on fusion of spammer behavior and linguistics features

E-commerce sites, forums, and blogs have become popular platforms for people to share their views. Reviews have emerged as a crucial source of information for potential customers, influencing their purchasing decisions. Similarly for gaining profit or fame, Spam reviews are deliberately written with...

Full description

Saved in:
Bibliographic Details
Main Authors: Amna Iqbal, Muhammad Younas, Saman Iftikhar, Fakeeha Fatima, Rabia Saleem
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Egyptian Informatics Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110866524001683
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841560594530959360
author Amna Iqbal
Muhammad Younas
Saman Iftikhar
Fakeeha Fatima
Rabia Saleem
author_facet Amna Iqbal
Muhammad Younas
Saman Iftikhar
Fakeeha Fatima
Rabia Saleem
author_sort Amna Iqbal
collection DOAJ
description E-commerce sites, forums, and blogs have become popular platforms for people to share their views. Reviews have emerged as a crucial source of information for potential customers, influencing their purchasing decisions. Similarly for gaining profit or fame, Spam reviews are deliberately written with the intention of defaming businesses or individuals. This act is known as review spamming. Spam review detection is rapidly answered by various ML techniques. Review of spamming is more challenging task in multilingual communities. Spammer behavior features and linguistic features often exhibit complex relationships that influence the nature of spam reviews. The unified representation of features is another challenging task in spam detection. Various deep learning approaches have been proposed for review spamming, including different neural networks (Convolutional Neural Network, CNN). These methods are specialized in extracting the features but lack to capture feature dependencies effectively with other features. Spam Review Detection using the Fusion Gradient Boosting (GB) Model and Support Vector Machine (SVM) (Hybrid-BoostSVM) is proposed with fusion of spammer behavior features and linguistic features to automatically detect and classify the spam reviews. Fusion enables the proposed model to automatically learn the interactions between the features during the training process, allowing it to capture complex relationships and make predictions based on both types of features. It apparently shows the promising result by obtaining 94.6 % accuracy.
format Article
id doaj-art-e64d67bf491e4041820a741afc0b1e1c
institution Kabale University
issn 1110-8665
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Egyptian Informatics Journal
spelling doaj-art-e64d67bf491e4041820a741afc0b1e1c2025-01-04T04:56:03ZengElsevierEgyptian Informatics Journal1110-86652025-03-0129100605Spam detection using hybrid model on fusion of spammer behavior and linguistics featuresAmna Iqbal0Muhammad Younas1Saman Iftikhar2Fakeeha Fatima3Rabia Saleem4Computer Science at Government College University Faisalabad, PakistanDepartment of Information Technology, Government College University Faisalabad, Pakistan; Corresponding author.Faculty of Computer Studies, Arab Open University, Riyadh 11681, Saudi ArabiaDepartment of Computer Science, Govt. Islamia Graduate College (W), Eidgah Road, Faisalabad, PakistanDepartment of Information Technology, Government College University Faisalabad, PakistanE-commerce sites, forums, and blogs have become popular platforms for people to share their views. Reviews have emerged as a crucial source of information for potential customers, influencing their purchasing decisions. Similarly for gaining profit or fame, Spam reviews are deliberately written with the intention of defaming businesses or individuals. This act is known as review spamming. Spam review detection is rapidly answered by various ML techniques. Review of spamming is more challenging task in multilingual communities. Spammer behavior features and linguistic features often exhibit complex relationships that influence the nature of spam reviews. The unified representation of features is another challenging task in spam detection. Various deep learning approaches have been proposed for review spamming, including different neural networks (Convolutional Neural Network, CNN). These methods are specialized in extracting the features but lack to capture feature dependencies effectively with other features. Spam Review Detection using the Fusion Gradient Boosting (GB) Model and Support Vector Machine (SVM) (Hybrid-BoostSVM) is proposed with fusion of spammer behavior features and linguistic features to automatically detect and classify the spam reviews. Fusion enables the proposed model to automatically learn the interactions between the features during the training process, allowing it to capture complex relationships and make predictions based on both types of features. It apparently shows the promising result by obtaining 94.6 % accuracy.http://www.sciencedirect.com/science/article/pii/S1110866524001683Review spammingLinguistic featuresSpammer behavior featuresClassificationFeature engineeringSVM
spellingShingle Amna Iqbal
Muhammad Younas
Saman Iftikhar
Fakeeha Fatima
Rabia Saleem
Spam detection using hybrid model on fusion of spammer behavior and linguistics features
Egyptian Informatics Journal
Review spamming
Linguistic features
Spammer behavior features
Classification
Feature engineering
SVM
title Spam detection using hybrid model on fusion of spammer behavior and linguistics features
title_full Spam detection using hybrid model on fusion of spammer behavior and linguistics features
title_fullStr Spam detection using hybrid model on fusion of spammer behavior and linguistics features
title_full_unstemmed Spam detection using hybrid model on fusion of spammer behavior and linguistics features
title_short Spam detection using hybrid model on fusion of spammer behavior and linguistics features
title_sort spam detection using hybrid model on fusion of spammer behavior and linguistics features
topic Review spamming
Linguistic features
Spammer behavior features
Classification
Feature engineering
SVM
url http://www.sciencedirect.com/science/article/pii/S1110866524001683
work_keys_str_mv AT amnaiqbal spamdetectionusinghybridmodelonfusionofspammerbehaviorandlinguisticsfeatures
AT muhammadyounas spamdetectionusinghybridmodelonfusionofspammerbehaviorandlinguisticsfeatures
AT samaniftikhar spamdetectionusinghybridmodelonfusionofspammerbehaviorandlinguisticsfeatures
AT fakeehafatima spamdetectionusinghybridmodelonfusionofspammerbehaviorandlinguisticsfeatures
AT rabiasaleem spamdetectionusinghybridmodelonfusionofspammerbehaviorandlinguisticsfeatures