Feature importance analysis for spammer detection in Sina Weibo

Microblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification featu...

Full description

Saved in:
Bibliographic Details
Main Authors: Yu-xiang ZHANG, Yu SUN, Jia-hai YANG, Da-lei ZHOU, Xiang-fei MENG, Chun-jing XIAO
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2016-08-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539171160686592
author Yu-xiang ZHANG
Yu SUN
Jia-hai YANG
Da-lei ZHOU
Xiang-fei MENG
Chun-jing XIAO
author_facet Yu-xiang ZHANG
Yu SUN
Jia-hai YANG
Da-lei ZHOU
Xiang-fei MENG
Chun-jing XIAO
author_sort Yu-xiang ZHANG
collection DOAJ
description Microblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification features or putting forward new classifiers.Which kind of issues would be put the high priority of an enormous amount of research effort into? Are extensive features or novel classifiers better for the detection accuracy of spammers? It is tried to address these questions through combining different feature selection methods with different classifiers on a real Sina Weibo dataset.Experimental results show that selected features are more important than novel classifiers for spammer detection.In addition,features should be derived from a wide range,such as text contents,user behaviors,and social relationship,and the dimension of features should not be too high.These results will be useful in finding the breakpoint of Microblog anti-spam works in the future.
format Article
id doaj-art-120c642e43df4e8e96501124301527ef
institution Kabale University
issn 1000-436X
language zho
publishDate 2016-08-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-120c642e43df4e8e96501124301527ef2025-01-14T07:25:29ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2016-08-0137243359702584Feature importance analysis for spammer detection in Sina WeiboYu-xiang ZHANGYu SUNJia-hai YANGDa-lei ZHOUXiang-fei MENGChun-jing XIAOMicroblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification features or putting forward new classifiers.Which kind of issues would be put the high priority of an enormous amount of research effort into? Are extensive features or novel classifiers better for the detection accuracy of spammers? It is tried to address these questions through combining different feature selection methods with different classifiers on a real Sina Weibo dataset.Experimental results show that selected features are more important than novel classifiers for spammer detection.In addition,features should be derived from a wide range,such as text contents,user behaviors,and social relationship,and the dimension of features should not be too high.These results will be useful in finding the breakpoint of Microblog anti-spam works in the future.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/Sina Weibofeature definitionfeature selectionspammer detection
spellingShingle Yu-xiang ZHANG
Yu SUN
Jia-hai YANG
Da-lei ZHOU
Xiang-fei MENG
Chun-jing XIAO
Feature importance analysis for spammer detection in Sina Weibo
Tongxin xuebao
Sina Weibo
feature definition
feature selection
spammer detection
title Feature importance analysis for spammer detection in Sina Weibo
title_full Feature importance analysis for spammer detection in Sina Weibo
title_fullStr Feature importance analysis for spammer detection in Sina Weibo
title_full_unstemmed Feature importance analysis for spammer detection in Sina Weibo
title_short Feature importance analysis for spammer detection in Sina Weibo
title_sort feature importance analysis for spammer detection in sina weibo
topic Sina Weibo
feature definition
feature selection
spammer detection
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/
work_keys_str_mv AT yuxiangzhang featureimportanceanalysisforspammerdetectioninsinaweibo
AT yusun featureimportanceanalysisforspammerdetectioninsinaweibo
AT jiahaiyang featureimportanceanalysisforspammerdetectioninsinaweibo
AT daleizhou featureimportanceanalysisforspammerdetectioninsinaweibo
AT xiangfeimeng featureimportanceanalysisforspammerdetectioninsinaweibo
AT chunjingxiao featureimportanceanalysisforspammerdetectioninsinaweibo