Feature importance analysis for spammer detection in Sina Weibo
Microblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification featu...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2016-08-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841539171160686592 |
---|---|
author | Yu-xiang ZHANG Yu SUN Jia-hai YANG Da-lei ZHOU Xiang-fei MENG Chun-jing XIAO |
author_facet | Yu-xiang ZHANG Yu SUN Jia-hai YANG Da-lei ZHOU Xiang-fei MENG Chun-jing XIAO |
author_sort | Yu-xiang ZHANG |
collection | DOAJ |
description | Microblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification features or putting forward new classifiers.Which kind of issues would be put the high priority of an enormous amount of research effort into? Are extensive features or novel classifiers better for the detection accuracy of spammers? It is tried to address these questions through combining different feature selection methods with different classifiers on a real Sina Weibo dataset.Experimental results show that selected features are more important than novel classifiers for spammer detection.In addition,features should be derived from a wide range,such as text contents,user behaviors,and social relationship,and the dimension of features should not be too high.These results will be useful in finding the breakpoint of Microblog anti-spam works in the future. |
format | Article |
id | doaj-art-120c642e43df4e8e96501124301527ef |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2016-08-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-120c642e43df4e8e96501124301527ef2025-01-14T07:25:29ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2016-08-0137243359702584Feature importance analysis for spammer detection in Sina WeiboYu-xiang ZHANGYu SUNJia-hai YANGDa-lei ZHOUXiang-fei MENGChun-jing XIAOMicroblog has drawn attention of not only legitimate users but also spammers.The garbage information pro-vided by spammers handicaps users' experience significantly.In order to improve the detection accuracy of spammers,most existing studies on spam focus on generating more classification features or putting forward new classifiers.Which kind of issues would be put the high priority of an enormous amount of research effort into? Are extensive features or novel classifiers better for the detection accuracy of spammers? It is tried to address these questions through combining different feature selection methods with different classifiers on a real Sina Weibo dataset.Experimental results show that selected features are more important than novel classifiers for spammer detection.In addition,features should be derived from a wide range,such as text contents,user behaviors,and social relationship,and the dimension of features should not be too high.These results will be useful in finding the breakpoint of Microblog anti-spam works in the future.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/Sina Weibofeature definitionfeature selectionspammer detection |
spellingShingle | Yu-xiang ZHANG Yu SUN Jia-hai YANG Da-lei ZHOU Xiang-fei MENG Chun-jing XIAO Feature importance analysis for spammer detection in Sina Weibo Tongxin xuebao Sina Weibo feature definition feature selection spammer detection |
title | Feature importance analysis for spammer detection in Sina Weibo |
title_full | Feature importance analysis for spammer detection in Sina Weibo |
title_fullStr | Feature importance analysis for spammer detection in Sina Weibo |
title_full_unstemmed | Feature importance analysis for spammer detection in Sina Weibo |
title_short | Feature importance analysis for spammer detection in Sina Weibo |
title_sort | feature importance analysis for spammer detection in sina weibo |
topic | Sina Weibo feature definition feature selection spammer detection |
url | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016152/ |
work_keys_str_mv | AT yuxiangzhang featureimportanceanalysisforspammerdetectioninsinaweibo AT yusun featureimportanceanalysisforspammerdetectioninsinaweibo AT jiahaiyang featureimportanceanalysisforspammerdetectioninsinaweibo AT daleizhou featureimportanceanalysisforspammerdetectioninsinaweibo AT xiangfeimeng featureimportanceanalysisforspammerdetectioninsinaweibo AT chunjingxiao featureimportanceanalysisforspammerdetectioninsinaweibo |